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PREFACE 


Unlike Newton’s mechanics, or Maxwell’s electrodynamics, or Einstein’s relativity, 
quantum theory was not created—or even definitively packaged—by one individual, 
and it retains to this day some of the scars of its exhilirating but traumatic youth. 
There is no general consensus as to what its fundamental principles are, how it should 
be taught, or what it really “means.” Every competent physicist can “do” quantum 
mechanics, but the stories we tell ourselves about what we are doing are as various 
as the tales of Scheherazade, and almost as implausible. Richard Feynman (one of 
its greatest practitioners) remarked, “I think I can safely say that nobody understands 
quantum mechanics.” 

The purpose of this book is to teach you how to do quantum mechanics. Apart 
from some essential background in Chapter 1, the deeper quasi-philosophical ques¬ 
tions are saved for the end. I do not believe one can intelligently discuss what quantum 
mechanics means until one has a firm sense of what quantum mechanics does. But if 
you absolutely cannot wait, by all means read the Afterword immediately following 
Chapter 1. 

Not only is quantum theory conceptually rich, it is also technically difficult, 
and exact solutions to all but the most artificial textbook examples are few and far 
between. It is therefore essential to develop special techniques for attacking more 
realistic problems. Accordingly, this book is divided into two parts 1 ; Part I covers 
the basic theory, and Part II assembles an arsenal of approximation schemes, with 
illustrative applications. Although it is important to keep the two parts logically 
separate, it is not necessary to study the material in the order presented here. Some 
instructors, for example, may wish to treat time-independent perturbation theory 
immediately after Chapter 2. 


'This structure was inspired by David Park’s classic text Introduction to the Quantum Theory, 3rd 
ed„ (New York: McGraw-Hill, 1992). 
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This book is intended for a one-semester or one-year course at the junior or 
senior level. A one-semester course will have to concentrate mainly on Part I; a 
full-year course should have room for supplementary material beyond Part II. The 
reader must be familiar with the rudiments of linear algebra, complex numbers, and 
calculus up through partial derivatives; some acquaintance with Fourier analysis and 
the Dirac delta function would help. Elementary classical mechanics is essential, of 
course, and a little electrodynamics would be useful in places. As always, the more 
physics and math you know the easier it will be, and the more you will get out of your 
study. But I would like to emphasize that quantum mechanics is not, in my view, 
something that flows smoothly and naturally from earlier theories. On the contrary, 
it represents an abrupt and revolutionary departure from classical ideas, calling forth 
a wholly new and radically counterintuitive way of thinking about the world. That, 
indeed, is what makes it such a fascinating subject. 

At first glance, this book may strike you as forbiddingly mathematical. We en¬ 
counter Legendre, Hermite, and Laguerre polynomials, spherical harmonics, Bessel, 
Neumann, and Hankel functions, Airy functions, and even the Riemann Zeta function 
—not to mention Fourier transforms, Hilbert spaces, Hermitian operators, Clebsch- 
Gordan coefficients, and Lagrange multipliers. Is all this baggage really necessary? 
Perhaps not, but physics is like carpentry: Using the right tool makes the job easier, 
not more difficult, and teaching quantum mechanics without the appropriate mathe¬ 
matical equipment is like asking the student to dig a foundation with a screwdriver. 
(On the other hand, it can be tedious and diverting if the instructor feels obliged to 
give elaborate lessons on the proper use of each tool. My own instinct is to hand the 
students shovels and tell them to start digging. They may develop blisters at first, but I 
still think this is the most efficient and exciting way to learn.) At any rate, I can assure 
you that there is no deep mathematics in this book, and if you run into something 
unfamiliar, and you don’t find my explanation adequate, by all means ask someone 
about it, or look it up. There are many good books on mathematical methods—I par¬ 
ticularly recommend Mary Boas, Mathematical Methods in the Physical Sciences, 
2nd ed., Wiley, New York (1983), and George Arfken, Mathematical Methods for 
Physicists, 3rd ed., Academic Press, Orlando (1985). But whatever you do, don’t let 
the mathematics—which, for us, is only a tool—interfere with the physics. 

Several readers have noted that there are fewer worked examples in this book 
than is customary, and that some important material is relegated to the problems. This 
is no accident. I don’t believe you can learn quantum mechanics without doing many 
exercises for yourself. Instructors should, of course, go over as many problems in 
class as time allows, but students should be warned that this is not a subject about 
which anyone has natural intuitions—you’re developing a whole new set of muscles 
here, and there is simply no substitute for calisthenics. Mark Semon suggested that I 
offer a “Michelin Guide” to the problems, with varying numbers of stars to indicate 
the level of difficulty and importance. This seemed like a good idea (though, like the 
quality of a restaurant, the significance of a problem is partly a matter of taste); I have 
adopted the following rating scheme: 
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* an essential problem that every reader should study; 

** a somewhat more difficult or more peripheral problem; 

* * * an unusually challenging problem that may take over an hour. 

(No stars at all means fast food: OK if you’re hungry, but not very nourishing.) Most 
of the one-star problems appear at the end of the relevant section; most of the three-star 
problems are at the end of the chapter, A solution manual is available (to instructors 
only) from the publisher. 

I have benefited from the comments and advice of many colleagues, who sug¬ 
gested problems, read early drafts, or used a preliminary version in their courses. I 
would like to thank in particular Burt Brody (Bard College), Ash Carter (Drew Uni¬ 
versity), Peter Codings (Swarthmore College), Jeff Dunham (Middlebury College), 
Greg Elliott (University of Puget Sound), Larry Hunter (Amherst College), Mark 
Semon (Bates College), Stavros Theodorakis (University of Cyprus), Dan Velleman 
(Amherst College), and all my colleagues at Reed College. 

Finally, I wish to thank David Park and John Rasmussen (and their publishers) 
for permission to reproduce Figure 8.6, which is taken from Park’s Introduction to the 
Quantum Theory (footnote 1), adapted from I. Perlman and J. O. Rasmussen, “Alpha 
Radioactivity,” in Encyclopedia of Physics, vol. 42, Springer-Verlag, 1957. 




PARTI 


THEORY 




CHAPTER 1 


THE WAVE FUNCTION 


1.1 THE SCHRODINGER EQUATION 


Imagine a particle of mass m, constrained to move along the x-axis, subject to some 
specified force F(x, t) (Figure 1.1). The program of classical mechanics is to deter¬ 
mine the position of the particle at any given time: x(t). Once we know that, we can 
figure out the velocity (u = dx/dt), the momentum ( p = mv), the kinetic energy 
(T = (1/2 )mv 2 ), or any other dynamical variable of interest. And how do we go 
about determining x{t)l We apply Newton’s second law: F = ma. (For conservative 
systems—the only kind we shall consider, and, fortunately, the only kind that occur 
at the microscopic level—the force can be expressed as the derivative of a potential 
energy function, 1 F = —dVjdx, and Newton’s law reads m d 2 xjdt 2 = —dV/dx.) 
This, together with appropriate initial conditions (typically the position and velocity 
at t = 0), determines x(f). 

Quantum mechanics approaches this same problem quite differently. In this 
case what we’re looking for is the wave function, 'I'Or, t), of the particle, and we get 
it by solving the Schrodinger equation: 


3 ^ 
3 1 


h 2 3 2 ^ 
2m 3x 2 


+ VV. 


[ 1 . 1 ] 


'Magnetic forces are an exception, but let’s not worry about them just yet. By the way, we shall 
assume throughout this book that the motion is nonrelativistic (v <5C c). 
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x(t) 


V~U 


~F(x,t) 


Figure 1.1: A “particle” constrained to move in one dimension under the influ¬ 
ence of a specified force. 


Here i is the square root of —1, and h is Planck’s constant—or rather, his original 
constant ( h ) divided by 2;r: 

h = — = 1.054573 x 1(T 34 J s. [1.2] 

27T 

The Schrodinger equation plays a role logically analogous to Newton’s second law: 
Given suitable initial conditions [typically, 4*(x, 0)], the Schrodinger equation de¬ 
termines W(x, t) for all future time, just as, in classical mechanics, Newton’s law 
determines x(0 for all future time. 


1.2 THE STATISTICAL INTERPRETATION 


But what exactly is this “wave function”, and what does it do for you once you’ve got 
it? After all, a particle, by its nature, is localized at a point, whereas the wave function 
(as its name suggests) is spread out in space (it’s a function of x, for any given time 
0- How can such an object be said to describe the state of a particle ? The answer is 
provided by Bom’s statistical interpretation of the wave function, which says that 
I'l'fx, t)\ 2 gives the probability of finding the particle at point x, at time t —or, more 
precisely, 2 


I \ 2 d —\ probability of finding the particle 
X ’ 2 X between x and (x + dx), at time t. 


[1.3] 


For the wave function in Figure 1.2, you would be quite likely to find the particle in 
the vicinity of point A, and relatively unlikely to find it near point B. 

The statistical interpretation introduces a kind of indeterminacy into quantum 
mechanics, for even if you know everything the theory has to tell you about the 


2 The wave function itself is complex, but |4'| 2 = (where 4 1 * is the complex conjugate of 40 
is real and nonnegative—as a probability, of course, must be. 
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Figure 1.2: A typical wave function. The particle would be relatively likely to be 
found near A, and unlikely to be found near B. The shaded area represents the 
probability of finding the particle in the range dx. 


particle (to wit: its wave function), you cannot predict with certainty the outcome 
of a simple experiment to measure its position—all quantum mechanics has to offer 
is statistical information about the possible results. This indeterminacy has been 
profoundly disturbing to physicists and philosophers alike. Is it a peculiarity of 
nature, a deficiency in the theory, a fault in the measuring apparatus, or whatl 

Suppose I do measure the position of the particle, and I find it to be at the point 
C. Question: Where was the particle just before I made the measurement? There 
are three plausible answers to this question, and they serve to characterize the main 
schools of thought regarding quantum indeterminacy: 

1. The realist position: The particle was at C. This certainly seems like a 
sensible response, and it is the one Einstein advocated. Note, however, that if this is 
true then quantum mechanics is an incomplete theory, since the particle really was at 
C, and yet quantum mechanics was unable to tell us so. To the realist, indeterminacy 
is not a fact of nature, but a reflection of our ignorance. As d’Espagnat put it, “the 
position of the particle was never indeterminate, but was merely unknown to the 
experimenter.” 3 Evidently ^ is not the whole story—some additional information 
(known as a hidden variable) is needed to provide a complete description of the 
particle. 

2. The orthodox position: The particle wasn’t really anywhere. It was the act 
of measurement that forced the particle to “take a stand” (though how and why it 
decided on the point C we dare not ask). Jordan said it most starkly: “Observations 
not only disturb what is to be measured, they produce it. ... We compel [the particle] 
to assume a definite position.” 4 This view (the so-called Copenhagen interpretation) 
is associated with Bohr and his followers. Among physicists it has always been the 


3 Bemard d’Espagnat, The Quantum Theory and Reality, Scientific American, Nov. 1979 
(Vol. 241), p. 165. 

4 Quoted in a lovely article by N. David Mermin, Is the moon there when nobody looks?. Physics 
Today, April 1985, p. 38. 
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most widely accepted position. Note, however, that if it is correct there is something 
very peculiar about the act of measurement—something that over half a century of 
debate has done precious little to illuminate. 

3. The agnostic position: Refuse to answer. This is not quite as silly as it 
sounds—after all, what sense can there be in making assertions about the status of 
a particle before a measurement, when the only way of knowing whether you were 
right is precisely to conduct a measurement, in which case what you get is no longer 
“before the measurement”? It is metaphysics (in the perjorative sense of the word) to 
worry about something that cannot, by its nature, be tested. Pauli said, “One should 
no more rack one’s brain about the problem of whether something one cannot know 
anything about exists all the same, than about the ancient question of how many angels 
are able to sit on the point of a needle.” 5 For decades this was the “fail-back” position 
of most physicists: They’d try to sell you answer 2, but if you were persistent they’d 
switch to 3 and terminate the conversation. 

Until fairly recently, all three positions (realist, orthodox, and agnostic) had 
their partisans. But in 1964 John Bell astonished the physics community by showing 
that it makes an observable difference if the particle had a precise (though unknown) 
position prior to the measurement. Bell’s discovery effectively eliminated agnosticism 
as a viable option, and made it an experimental question whether 1 or 2 is the correct 
choice. I’ll return to this story at the end of the book, when you will be in a better 
position to appreciate Bell’s theorem; for now, suffice it to say that the experiments 
have confirmed decisively the orthodox interpretation 6 : A particle simply does not 
have a precise position prior to measurement, any more than the ripples on a pond do; 
it is the measurement process that insists on one particular number, and thereby in a 
sense creates the specific result, limited only by the statistical weighting imposed by 
the wave function. 

But what if I made a second measurement, immediately after the first? Would I 
get C again, or does the act of measurement cough up some completely new number 
each time? On this question everyone is in agreement: A repeated measurement (on 
the same particle) must return the same value. Indeed, it would be tough to prove that 
the particle was really found at C in the first instance if this could not be confirmed 
by immediate repetition of the measurement. How does the orthodox interpretation 
account for the fact that the second measurement is bound to give the value C? 
Evidently the first measurement radically alters the wave function, so that it is now 
sharply peaked about C (Figure 1.3). We say that the wave function collapses upon 
measurement, to a spike at the point C ('F soon spreads out again, in accordance with 
the Schrodinger equation, so the second measurement must be made quickly). There 


5 Quoted by Mermin (previous footnote), p. 40. 

6 This statement is a little too strong: There remain a few theoretical and experimental loopholes, 
some of which I shall discuss in the Afterword. And there exist other formulations (such as the many 
worlds interpretation) that do not fit cleanly into any of my three categories. But I think it is wise, at least 
from a pedagogical point of view, to adopt a clear and coherent platform at this stage, and worry about the 
alternatives later. 
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Figure 1.3: Collapse of the wave function: graph of \V\ 2 immediately after a 
measurement has found the particle at point C. 


are, then, two entirely distinct kinds of physical processes: “ordinary” ones, in which 
the wave function evolves in a leisurely fashion under the Schrodinger equation, and 
“measurements”, in which 'I' suddenly and discontinuously collapses. 7 


1.3 PROBABILITY _._ 

Because of the statistical interpretation, probability plays a central role in quantum 
mechanics, so I digress now for a brief discussion of the theory of probability. It is 
mainly a question of introducing some notation and terminology, and I shall do it in 
the context of a simple example. 

Imagine a room containing 14 people, whose ages are as follows: 

one person aged 14 
one person aged 15 
three people aged 16 
two people aged 22 
two people aged 24 
five people aged 25. 

If we let N(j) represent the number of people of age j, then 


7 The role of measurement in quantum mechanics is so critical and so bizarre that you may well 
be wondering what precisely constitutes a measurement. Does it have to do with the interaction between 
a microscopic (quantum) system and a macroscopic (classical) measuring apparatus (as Bohr insisted), 
or is it characterized by the leaving of a permanent “record” (as Heisenberg claimed), or does it involve 
the intervention of a conscious “observer” (as Wigner proposed)? I’ll return to this thorny issue in the 
Afterword; for the moment let’s take the naive view: A measurement is the kind of thing that a scientist 
does in the laboratory, with rulers, stopwatches, Geiger counters, and so on. 
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Figure 1.4: Histogram showing the number of people, N(j), with age j, for the 
example in Section 1.3. 


7/(14) = 1 
# (15) = 1 
N(16) = 3 
N(22) = 2 
TV (24) = 2 
7/(25) = 5 

while 7/(17), for instance, is zero. The total number of people in the room is 


OO 

N='£N(j). [1.4] 

j -0 

(In this instance, of course, N = 14.) Figure 1.4 is a histogram of the data. The 
following are some questions one might ask about this distribution. 

Question 1. If you selected one individual at random from this group, what is 
the probability that this person’s age would be 15? Answer: One chance in 14, since 
there are 14 possible choices, all equally likely, of whom only one has this particular 
age. If P(j) is the probability of getting age j, then P(14) = 1/14, />(15) = 
1/14, /’(lb) = 3/14, and soon. In general, 


PU) = 


N(j) 


[1.5] 


Notice that the probability of getting either 14 or 15 is the sum of the individual 
probabilities (in this case, 1/7). In particular, the sum of all the probabilities is 1— 
you’re certain to get some age: 


E 


;=i 


P(j) = 1. 


[1.6] 
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Question 2. What is the most probable age? Answer: 25, obviously; five 
people share this age, whereas at most three have any other age. In general, the most 
probable j is the j for which P(j) is a maximum. 

Question 3. What is the median age? Answer: 23, for 7 people are younger 
than 23, and 7 are older. (In general, the median is that value of j such that the 
probability of getting a larger result is the same as the probability of getting a smaller 
result.) 

Question 4. What is the average (or mean) age? Answer: 

(14) + (15) + 3(16) + 2(22) + 2(24) + 5(25) = 294 = ^ 

14 14 

In general, the average value of j (which we shall write thus: (j)) is given by 

o> = £^ = !></)• [i-7i 

j =o 

Notice that there need not be anyone with the average age or the median age—in this 
example nobody happens to be 21 or 23. In quantum mechanics the average is usually 
the quantity of interest; in that context it has come to be called the expectation value. 
It’s a misleading term, since it suggests that this is the outcome you would be most 
likely to get if you made a single measurement ( that would be the most probable 
value, not the average value)—but I’m afraid we’re stuck with it. 

Question 5. What is the average of the squares of the ages? Answer: You 
could get 14 2 = 196, with probability 1/14, or 15 2 = 225, with probability 1/14, or 
16 2 = 256, with probability 3/14, and so on. The average, then, is 

u 2 ) = T,j 2p u)- a-*] 

j =o 

In general, the average value of some function of j is given by 

</(/>> = E/o>'’O')- i 1 - 9 ] 

j =0 

(Equations 1.6, 1.7, and 1.8 are, if you like, special cases of this formula.) Beware: 
The average of the squares ((j 2 )) is not ordinarily equal to the square of the average 
({j) 2 ). For instance, if the room contains just two babies, aged 1 and 3, then (x 2 ) = 5, 
but {x} 2 = 4. 

Now, there is a conspicuous difference between the two histograms in Figure 
1.5, even though they have the same median, the same average, the same most prob¬ 
able value, and the same number of elements: The first is sharply peaked about the 
average value, whereas the second is broad and flat. (The first might represent the 
age profile for students in a big-city classroom, and the second the pupils in a one- 
room schoolhouse.) We need a numerical measure of the amount of “spread” in a 
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Figure 1.5: Two histograms with the same median, same average, and same 
most probable value, but different standard deviations. 


distribution, with respect to the average. The most obvious way to do this would be 
to find out how far each individual deviates from the average, 

N = j - (j), [UO] 

and compute the average of A j. Trouble is, of course, that you get zero, since, by the 
nature of the average, A j is as often negative as positive: 

< A /> = £o - u))P(i) - Y.j p o - u) £ pu) 

= (/> - (j) = o. 

(Note that {j) is constant—it does not change as you go from one member of the 
sample to another—so it can be taken outside the summation.) To avoid this irritating 
problem, you might decide to average the absolute value of A j. But absolute values 
are nasty to work with; instead, we get around the sign problem by squaring before 
averaging: 

<t 2 = ((A j) 2 ). [1.11] 

This quantity is known as the variance of the distribution; a itself (the square root 
of the average of the square of the deviation from the average—gulp!) is called the 
standard deviation. The latter is the customary measure of the spread about (j). 
There is a useful little theorem involving standard deviations: 

a 2 = ((A;) 2 } = ^(AjfPU) = £y - (j)) 2 P<J) 

- £(/ - 2 j{j) + (j) 2 )P(j) 

= £ J 2 P(j) - 2</> £ jP(j) + uY £ PU) 

= U 2 )-2{j)U) + U) 2 . 


or 

O' 2 = </> - O') 2 . [1.12] 

Equation 1.12 provides a faster method for computing o : Simply calculate ( j 2 ) and 
(j) 2 , and subtract. Incidentally, I warned you a moment ago that {j 2 } is not, in general, 
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equal to { j) 2 . Since o 2 is plainly nonnegative (from its definition in Equation 1.11), 
Equation 1.12 implies that 

if) > U) 2 , [1-13] 

and the two are equal only when o = 0, which is to say, for distributions with no 
spread at all (every member having the same value). 

So far, I have assumed that we are dealing with a discrete variable—that is, one 
that can take on only certain isolated values (in the example, j had to be an integer, 
since I gave ages only in years). But it is simple enough to generalize to continuous 
distributions. If I select a random person off the street, the probability that her age is 
precisely 16 years, 4 hours, 27 minutes, and 3.3333 seconds is zero. The only sensible 
thing to speak about is the probability that her age lies in some interval— say, between 
16 years, and 16 years plus one day. If the interval is sufficiently short, this probability 
is proportional to the length of the interval. For example, the chance that her age is 
between 16 and 16 plus two days is presumably twice the probability that it is between 
16 and 16 plus one day. (Unless, I suppose, there was some extraordinary baby boom 
16 years ago, on exactly those days—in which case we have chosen an interval too 
long for the rule to apply. If the baby boom lasted six hours, we’ll take intervals of a 
second or less, to be on the safe side. Technically, we’re talking about infinitesimal 
intervals.) Thus 

I probability that individual (chosen at random) 
lies between x and (x + dx ) 


J = p(x) dx. [1.14] 


The proportionality factor, p(x), is often loosely called “the probability of getting 
x,” but this is sloppy language; a better term is probability density. The probability 
that x lies between a and b (a finite interval) is given by the integral of p(x): 


Pab = 


i: 


p(x) dx. 


[1-15] 


and the rules we deduced for discrete distributions translate in the obvious way: 


p(x)dx = 1, 


[1-16] 


(x> = 

p+oc 

1 xp(x)dx, 

1 -00 

[1.17] 

</(*)) = 

J 

p+oo 

/ f(x)p(x)dx, 

‘-00 

[1.18] 


Or 2 = ((Ax) 2 ) = <X 2 ) - <X) 2 . 


[1.19] 
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^Problem 1.1 For the distribution of ages in the example in Section 1.3, 

(a) Compute (j 2 ) and (j) 2 . 

(b) Determine A j for each j , and use Equation 1.11 to compute the standard devi¬ 
ation. 

(c) Use your results in (a) and (b) to check Equation 1.12. 


Problem 1.2 Consider the first 25 digits in the decimal expansion of n (3, 1, 4, 1, 
5,9,...). 

(a) If you selected one number at random from this set, what are the probabilities 
of getting each of the 10 digits? 

(b) What is the most probable digit? What is the median digit? What is the average 
value? 

(c) Find the standard deviation for this distribution. 


Problem 1.3 The needle on a broken car speedometer is free to swing, and bounces 
perfectly off the pins at either end, so that if you give it a flick it is equally likely to 
come to rest at any angle between 0 and tt. 

(a) What is the probability density, p {9) ? [p(9)d9 is the probability that the needle 
will come to rest between 9 and (9 + d9 ).] Graph p{9) as a function of 9, from 
—tt/ 2 to 3 tt/ 2. (Of course, part of this interval is excluded, so p is zero there.) 
Make sure that the total probability is 1. 

(b) Compute (9), {9 Z ), and cr for this distribution. 

(c) Compute (sin 9), (cos 9), and <cos 2 0). 


Problem 1.4 We consider the same device as the previous problem, but this time 
we are interested in the x-coordinate of the needle point—that is, the “shadow”, or 
“projection”, of the needle on the horizontal line. 

(a) What is the probability density p(x)l [p(x)dx is the probability that the pro¬ 
jection lies between x and (x + dx).] Graph p(x) as a function of x, from —2 r 
to +2r, where r is the length of the needle. Make sure the total probability is 1. 
t Hint: You know (from Problem 1.3) the probability that 9 is in a given range; 
the question is, what interval dx corresponds to the interval d91] 

(b) Compute (x), (x 2 ), and o for this distribution. Explain how you could have 
obtained these results from part (c) of Problem 1.3. 
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** Problem 1.5 A needle of length / is dropped at random onto a sheet of paper ruled 
with parallel lines a distance / apart. What is the probability that the needle will cross 
a line? [Hint: Refer to Problem 1.4.] 

❖Problem 1.6 Consider the Gaussian distribution 

p{x) = Ae~ x{x ~ a)2 , 

where A, a, and A are constants. (Look up any integrals you need.) 

(a) Use Equation 1.16 to determine A. 

(b) Find (x), {x 2 }, and a. 

(c) Sketch the graph of p(x). 


1.4 NORMALIZATION 


We return now to the statistical interpretation of the wave function (Equation 1.3), 
which says that |vp(*, t) | 2 is the probability density for finding the particle at point x, 
at time t. It follows (Equation 1.16) that the integral of |4>| 2 must be 1 (the particle’s 
got to be somewhere): 


/: 


\V(x,t)\ 2 dx = 1. 


[ 1 . 20 ] 


Without this, the statistical interpretation would be nonsense. 

However, this requirement should disturb you: After all, the wave function is 
supposed to be determined by the Schrodinger equation—we can’t impose an extrane¬ 
ous condition on 'P without checking that the two are consistent. A glance at Equation 
1.1 reveals that if vp(x, t) is a solution, so too is Aty(x, t ), where A is any (complex) 
constant. What we must do, then, is pick this undetermined multiplicative factor so as 
to ensure that Equation 1.20 is satisfied. This process is called normalizing the wave 
function. For some solutions to the Schrodinger equation, the integral is infinite ; in 
that case no multiplicative factor is going to make it 1. The same goes for the trivial 
solution = 0. Such non-normalizable solutions cannot represent particles, and 
must be rejected. Physically realizable states correspond to the “square-integrable” 
solutions to Schrodinger’s equation. 8 


Evidently fi'O, t) must go to zero faster than 1 /Vfx 7 , as |x| -> oo. Incidentally, normalization 
only fixes the modulus of A ; the phase remains undetermined. However, as we shall see, the latter carries 
no physical significance anyway. 
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But wait a minute! Suppose I have normalized the wave function at time t = 0. 
How do I know that it will stay normalized, as time goes on and 4* evolves? (You can’t 
keep re normalizing the wave function, for then A becomes a function of t, and you 
no longer have a solution to the Schrodinger equation.) Fortunately, the Schrodinger 
equation has the property that it automatically preserves the normalization of the wave 
function—without this crucial feature the Schrodinger equation would be incompat¬ 
ible with the statistical interpretation, and the whole theory would crumble. So we’d 
better pause for a careful proof of this point: 

d f +0 ° /*+°° 3 

-J \V{x,t)\ 2 dx = J -\V(x,t)\ 2 dx. [1.21] 


[Note that the integral is a function only of t, so I use a total derivative ( d / dt ) in the 
first term, but the integrand is a function of x as well as t, so it’s a partial derivative 
(3/3 1) in the second one.] By the product rule. 


3 , 3 * * 34 > 3 ^* 

_i*i> = 


Now the Schrodinger equation says that 


dV 

~dt 


m d 2 v i ¥ 

2m dx 2 h 


and hence also (taking the complex conjugate of Equation 1.23) 
3^* in 3 2| F* 


3/ 


- + - W* 
2m dx 2 n 


[ 1 . 22 ] 


[1-23] 


[1.24] 


S' 


,,,2 >»/., a 2 '*' a ra /..jii 

f l '*' 1 - 2m V dx 2 dx 2 V “ 3* [ 2 m l* d7 - 17 )J ’ [125] 

The integral (Equation 1.21) can now be evaluated explicitly: 

d f + °° 2 M /d* 34/* \i+°o 

— |^(x,f)l dx = — ( V*- -—'FI • [1-26] 

dt J-oq 2m \ dx dx / I-00 

But ^(x, /) must go to zero as x goes to (±) infinity—otherwise the wave function 
would not be normalizable. It follows that 


d 

It 



\V(x,t)\ 2 dx 


= 0 , 


[1.27] 


and hence that the integral on the left is constant (independent of time); if 4* is 
normalized at t = 0, it stays normalized for all future time. QED 
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Problem 1.7 At time t — 0 a particle is represented by the wave function 

{ Ax/a, if 0 < x < a, 

A(b — x)/(b — a), if a < x < b, 

0, otherwise, 

where A, a, and b are constants. 

(a) Normalize 'l' (that is, find A in terms of a and b). 

(b) Sketch 'k (x, 0) as a function of x. 

(c) Where is the particle most likely to be found, at t = 0? 

<d) What is the probability of finding the particle to the left of a? Check your result 
in the limiting cases b — a and b — 2a. 

(e) What is the expectation value of x? 

* Problem 1.8 Consider the wave function 

^{x,t)=Ae~ x ^e~ iujt , 

where A, A, and uj are positive real constants. [We’ll see in Chapter 2 what potential 
( V ) actually produces such a wave function.] 

(a) Normalize 

(b) Determine the expectation values of x and x 2 . 

(c) Find the standard deviation of x. Sketch the graph of |^| 2 , as a function of x , 
and mark the points ({x) + a) and {(x) — a) to illustrate the sense in which a 
represents the “spread” in x. What is the probability that the particle would be 
found outside this range? 

Problem 1.9 Let Pab{t ) be the probability of finding the particle in the range 
(a < x < b), at time t. 


(a) Show that 


where 


-J± = J(a,t) - J(b,t) 


J(X, t ) : 


ih 

2m 


(* 


d 

dx 



What are the units of J(x, t)? [J is called the probability current, because 
it tells you the rate at which probability is “flowing” past the point x. If Pab{t) 
is increasing, then more probability is flowing into the region at one end than 
flows out at the other.] 
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(b) Find the probability current for the wave function in the previous problem. (This 
is not a very pithy example, I’m afraid; we’ll encounter some more substantial 
ones in due course.) 

**Problem 1.10 Suppose you wanted to describe an unstable particle that sponta¬ 
neously disintegrates with a “lifetime” r. In that case the total probability of finding 
the particle somewhere should not be constant, but should decrease at (say) an expo¬ 
nential rate: 

/ +OC 

\V(x,t)\ 2 dx =e~ t/T . 

■oo 

A crude way of achieving this result is as follows. In Equation 1.24 we tacitly assumed 
that V (the potential energy) is real. That is certainly reasonable, but it leads to the 
conservation of probability enshrined in Equation 1.27. What if we assign to V an 
imaginary part: 

V = V 0 - iT, 

where Vq is the true potential energy and T is a positive real constant? 

(a) Show that (in place of Equation 1.27) we now get 
dP = _2T 

dt n ‘ 

(b) Solve for P(t), and find the lifetime of the particle in terms of T. 


1.5 MOMENTUM 


For a particle in state 4*, the expectation value of x is 



[1.28] 


What exactly does this mean? It emphatically does not mean that if you measure the 
position of one particle over and over again, f x\^\ 2 dx is the average of the results 
you’ll get. On the contrary, the first measurement (whose outcome is indeterminate) 
will collapse the wave function to a spike at the value actually obtained, and the 
subsequent measurements (if they’re performed quickly) will simply repeat that same 
result. Rather, {x) is the average of measurements performed on particles all in the 
state which means that either you must find some way of returning the particle 
to its original state after each measurement, or else you prepare a whole ensemble of 
particles, each in the same state and measure the positions of all of them: (x) is the 
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average of these results. [I like to picture a row of bottles on a shelf, each containing 
a particle in the state 'P (relative to the center of the bottle). A graduate student with 
a ruler is assigned to each bottle, and at a signal they all measure the positions of 
their respective particles. We then construct a histogram of the results, which should 
match I'l'l 2 , and compute the average, which should agree with (x). (Of course, since 
we’re only using a finite sample, we can’t expect perfect agreement, but the more 
bottles we use, the closer we ought to come.)] In short, the expectation value is the 
average of repeated measurements on an ensemble of identically prepared systems, 
not the average of repeated measurements on one and the same system. 

Now, as time goes on, (x) will change (because of the time dependence of 'I'), 
and we might be interested in knowing how fast it moves. Referring to Equations 
1.25 and 1.28, we see that 9 


d(x) 

dt 


: f 2 dx = £- fx-f (V^ - dx. [1.29] 

J at 2m J ox \ dx dx ) 


This expression can be simplified using integration by parts 10 : 


d(x) 

dt 


m 

~2m 


/* / a^P* \ 

lr*rirV dx - 


[1.30] 


[I used the fact that dx/dx = 1, and threw away the boundary term, on the ground 
that *P goes to zero at (±) infinity.] Performing another integration by parts on the 
second term, we conclude that 


d(x) 

dt 


m 


/** 


dv , 

j^ dx - 


[i.31] 


What are we to make of this result? Note that we’re talking about the “velocity” 
of the expectation value of x, which is not the same thing as the velocity of the particle. 
Nothing we have seen so far would enable us to calculate the velocity of a particle— 
it’s not even clear what velocity means in quantum mechanics. If the particle doesn’t 
have a determinate position (prior to measurement), neither does it have a well-defined 
velocity. All we could reasonably ask for is the probability of getting a particular 
value. We’ll see in Chapter 3 how to construct the probability density for velocity, 


9 To keep things from getting too cluttered, I suppress the limits of integration when they are ±oc. 
10 The product rule says that 


d .,, r dg df 

Tx {fg) = f Tx + Tx g 


from which it follows that 

[ f % dx = ~[ % gdx + fg \l- 

Under the integral sign, then, you can peel a derivative off one factor in a product and slap it onto the other 
one—it’ll cost you a minus sign, and you’ll pick up a boundary term. 
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given ; for our present purposes it will suffice to postulate that the expectation value 
of the velocity is equal to the time derivative of the expectation value of position: 


{v) = 


d(x) 

dt 


[ 132 ] 


Equation 1.31 tells us, then, how to calculate { v) directly from vp. 

Actually, it is customary to work with momentum ( p — mv), rather than ve¬ 
locity: __ 

11.331 


Let me write the expressions for (x) and {p) in a more suggestive way: 


(x) = j dx. 


[1.34] 


[ 1 . 35 ] 


We say that the operator 11 x “represents” position, and the operator (ft/ i)(d/dx) 
“represents” momentum, in quantum mechanics; to calculate expectation values, we 
“sandwich” the appropriate operator between vp* and and integrate. 

That’s cute, but what about other dynamical variables? The fact is, all such 
quantities can be written in terms of position and momentum. Kinetic energy, for 
example, is 


1 ? 

T = -mv 2 
2 


P 

2m ’ 


and angular momentum is 

L — r x mv = r x p 

(the latter, of course, does not occur for motion in one dimension). To calculate the 
expectation value of such a quantity, we simply replace every p by (fif i)(d/dx), 
insert the resulting operator between vp* and *P, and integrate: 


{Q(x, p )> = 


f * ft 

= J V*Q(X, T 


ft d_ 
dx 


)vp dx. 


[1.36] 


11 An operator is an instruction to do something to the function that follows. The position operator 
tells you to multiply by x; the momentum operator tells you to differentiate with respect to x (and multiply 
the result by — ih). In this book all operators will be derivatives (djdt, d 2 jdt 2 , d 2 /dxdy,e tc.) or multipliers 
(2, i, x 2 , etc.) or combinations of these. 
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For example, 

-h 2 f a 2 '!' 

< r> = I 11371 

Equation 1.36 is a recipe for computing the expectation value of any dynamical 
quantity for a particle in state 'P; it subsumes Equations 1.34 and 1.35 as special 
cases. I have tried in this section to make Equation 1.36 seem plausible, given Bom’s 
statistical interpretation, but the truth is that this equation represents such a radically 
new way of doing business (as compared with classical mechanics) that it’s a good 
idea to get some practice using it before we come back (in Chapter 3) and put it on 
a firmer theoretical foundation. In the meantime, if you prefer to think of it as an 
axiom, that’s fine with me. 


Problem 1.11 Why can’t you do integration by parts directly on the middle ex¬ 
pression in Equation 1.29—pull the time derivative over onto x, note that dx/dt — 0, 
and conclude that d(x)/dt — 0? 


^Problem 1.12 Calculated {p)/dt. Answer: 


dip) _ dV 
dt { dx } 


[1.38] 


(This is known as Ehrenfest’s theorem; it tells us that expectation values obey 
Newton’s second law.) 


Problem 1.13 Suppose you add a constant Vq to the potential energy (by “constant” 
I mean independent of x as well as t). In classical mechanics this doesn’t change 
anything, but what about quantum mechanics? Show that the wave function picks 
up a time-dependent phase factor: exp(— iV^t/h). WTiat effect does this have on the 
expectation value of a dynamical variable? 


1.6 THE UNCERTAINTY PRINCIPLE 


Imagine that you’re holding one end of a very long rope, and you generate a wave by 
shaking it up and down rhythmically (Figure 1.6). If someone asked you, “Precisely 
where is that wave?” you’d probably think he was a little bit nutty: The wave isn’t 
precisely anywhere—it’s spread out over 50 feet or so. On the other hand, if he asked 
you what its wavelength is, you could give him a reasonable answer: It looks like 
about 6 feet. By contrast, if you gave the rope a sudden jerk (Figure 1.7), you’d get a 
relatively narrow bump traveling down the line. This time the first question (Where 
precisely is the wave?) is a sensible one, and the second (What is its wavelength?) 
seems nutty—it isn’t even vaguely periodic, so how can you assign a wavelength to it? 
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Figure 1.6: A wave with a (fairly) well-defined wavelength but an ill-defined 
position. 

Of course, you can draw intermediate cases, in which the wave is fairly well localized 
and the wavelength is fairly well defined, but there is an inescapable trade-off here: 
The more precise a wave’s position is, the less precise is its wavelength, and vice 
versa. 12 A theorem in Fourier analysis makes all this rigorous, but for the moment I 
am only concerned with the qualitative argument. 

This applies, of course, to any wave phenomenon, and hence in particular to 
the quantum mechanical wave function. Now the wavelength of T* is related to the 
momentum of the particle by the de Broglie formula 13 : 


P 


h 

X 


2ith 


[1.39] 


Thus a spread in wavelength corresponds to a spread in momentum , and our general 
observation now says that the more precisely determined a particle’s position is, the 
less precisely its momentum is determined. Quantitatively, 


o x o p > 


h 

2 ’ 


[1.40] 


where o x is the standard deviation in x, and o p is the standard deviation in p. This 
is Heisenberg’s famous uncertainty principle. (We’ll prove it in Chapter 3, but I 
wanted to mention it here so you can test it out on the examples in Chapter 2.) 



x(feet) 


Figure 1.7: A wave with a (fairly) well-defined position but an ill-defined wave¬ 
length^_ 

12 That’s why a piccolo player must be right on pitch, whereas a double-bass player can afford to 
wear garden gloves. For the piccolo, a sixty-fourth note contains many full cycles, and the frequency (we’re 
working in the time domain now, instead of space) is well defined, whereas for the bass, at a much lower 
register, the sixty-fourth note contains only a few cycles, and all you hear is a general sort of “oomph,” 
with no very clear pitch. 

13 ril prove this in due course. Many authors take the de Broglie formula as an axiom, from 
which they then deduce the association of momentum with the operator (h/i)(d/dx). Although this is a 
conceptually cleaner approach, it involves diverting mathematical complications that I would rather save 
for later. 
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Please understand what the uncertainty principle means: Like position mea¬ 
surements, momentum measurements yield precise answers—the “spread” here refers 
to the fact that measurements on identical systems do not yield consistent results. You 
can, if you want, prepare a system such that repeated position measurements will be 
very close together (by making 'P a localized “spike”), but you will pay a price: Mo¬ 
mentum measurements on this state will be widely scattered. Or you can prepare a 
system with a reproducible momentum (by making 'I' a long sinusoidal wave), but in 
that case position measurements will be widely scattered. And, of course, if you’re in 
a really bad mood you can prepare a system in which neither position nor momentum 
is well defined: Equation 1.40 is an inequality, and there’s no limit on how big o x and 
<y p can be—just make some long wiggly line with lots of bumps and potholes and 
no periodic structure. 


^Problem 1.14 A particle of mass m is in the state 


'I'(jM) = Ae- a{{mxllh) ^ t] , 


where A and a are positive real constants. 

(a) Find A. 

(b) For what potential energy function V (x) does satisfy the Schrodinger equa¬ 
tion? 

(c) Calculate the expectation values of x,x 2 , p, and p 2 . 

(d) Find o x and o p . Is their product consistent with the uncertainty principle? 



CHAPTER 2 


THE TIME-INDEPENDENT 
SCHRODINGER EQUATION 


2.1 STATIONARY STATES _ 

In Chapter 1 we talked a lot about the wave function and how you use it to calculate 
various quantities of interest. The time has come to stop procrastinating and confront 
what is, logically, the prior question: How do you get 'T (x, t) in the first place—how 
do you go about solving the Schrodinger equation? I shall assume for all of this 
chapter (and most of this book) that the potential, 1 V, is independent oft. In that case 
the Schrodinger equation can be solved by the method of separation of variables 
(the physicist’s first line of attack on any partial differential equation): We look for 
solutions that are simple products, 

^(x,0 = \j/(x) f{t), [2.1] 

where \(/ (lowercase) is a function of x alone, and / is a function of t alone. On its 
face, this is an absurd restriction, and we cannot hope to get more than a tiny subset 
of all solutions in this way. But hang on, because the solutions we do obtain turn out 
to be of great interest. Moreover, as is typically the case with separation of variables, 
we will be able at the end to patch together the separable solutions in such a way as 
to construct the most general solution. 


1 It is tiresome to keep saying “potential energy function,” so most people just call V the “potential”, 
even though this invites occasional confusion with electric potential, which is actually potential energy 
per unit charge. 




Sec. 2.1: Stationary States 21 


For separable solutions we have 

9^ _ df d 2 ^ _ d 2 f 
dt ^ dt' dx 2 dx 2 

(i ordinary derivatives, now), and the Schrodinger equation (Equation 1.1) reads 


.df 

dt 


h 2 d 2 \Is 

- f+ V\Uf. 

2m dx 2 


Or, dividing through by xj/f: 


f dt 2m f dx 2 


[ 2 . 2 ] 


Now the left side is a function of t alone, and the right side is a function of x alone. 2 
The only way this can possibly be true is if both sides are in fact constant —otherwise, 
by varying t, I could change the left side without touching the right side, and the two 
would no longer be equal. (That’s a subtle but crucial argument, so if it’s new to you, 
be sure to pause and think it through.) For reasons that will appear in a moment, we 
shall call the separation constant E. Then 


.. 1 df 
in — —- = E, 
f dt 


or 


and 


or 


df _ 
dt Ti 


-El d 2± + V 

2m \j/ dx 2 


E, 



[2.3] 


[2.4] 


Separation of variables has turned a partial differential equation into two ordi¬ 
nary differential equations (Equations 2.3 and 2.4). The first of these is easy to solve 
(just multiply through by dt and integrate); the general solution is C exp(-iEtfh), 
but we might as well absorb the constant C into f (since the quantity of interest is 
the product \j/f). Then 

m = e- IE ‘ /t . [2.5] 


The second (Equation 2.4) is called the time-independent Schrodinger equation; 
we can go no further with it until the potential V (x) is specified. 


X. 


2 Note that this would not be true if V were a function of t as well 
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The rest of this chapter will be devoted to solving the time-independent Schro¬ 
dinger equation, for a variety of simple potentials. But before we get to that I would 
like to consider further the question: What ’.v so great about separable solutions ? After 
all, most solutions to the (time-dependent) Schrodinger equation do not take the form 
fix) f(t). I offer three answers—two of them physical and one mathematical: 

1. They are stationary states. Although the wave function itself, 


^(x, t) = f(x)e~ iEt/h , 


[ 2 . 6 ] 


does (obviously) depend on t, the probability density 

| V(x, t)\ 2 = 4'*'!' = f*e +iE ‘ /h fe~ iEt/h = \f{x)\ 2 [2.7] 

does not —the time dependence cancels out. 3 The same thing happens in calculating 
the expectation value of any dynamical variable; Equation 1.36 reduces to 

(Q(x, p)) = J f*Q(x,j^-)fdx. [2.8] 

Every expectation value is constant in time; we might as well drop the factor /( t ) 
altogether, and simply use f in place of 'I'. (Indeed, it is common to refer to f 
as “the wave function”, but this is sloppy language that can be dangerous, and it is 
important to remember that the true wave function always carries that exponential 
time-dependent factor.) In particular, (x) is constant, and hence (Equation 1.33) 
{p) = 0. Nothing ever happens in a stationary state. 

2. They are states of definite total energy. In classical mechanics, the total 
energy (kinetic plus potential) is called the Hamiltonian: 

p 2 

H(x,p) = ^- + V(x). [2.9] 

2m 

The corresponding Hamiltonian operator , obtained by the canonical substitution p 
(Ti/i)(d/ 3x), is therefore 4 


* Ti 2 d 2 

H = -*nr> + r ™- 


[ 2 . 10 ] 


Thus the time-independent Schrodinger equation (Equation 2.4) can be written 


Hf = Ef, 


[ 2 . 11 ] 


3 For normalizable solutions, E must be real (see Problem 2.1a). 

4 Whenever confusion might arise, I’ll put a “hat” (' ) on the operator to distinguish it from the 
dynamical variable it represents. 
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and the expectation value of the total energy is 

{H) ^ j f*Hfdx = E j lisfdx = E. [2.12] 

(Note that the normalization of entails the normalization of Moreover, 

H 2 f = H{Hf) = H(Ef) = E(Hijf) = E 2 f, 

and hence 

(H 2 ) = j \j/*H 2 \jf dx = E 2 j \f\ 2 dx = E 2 . 

So the standard deviation in H is given by 

a 2 = (H 2 ) - ( H } 2 = E 2 -E 2 = 0. [2.13] 

But remember, if a = 0, then every member of the sample must share the same value 
(the distribution has zero spread). Conclusion: A separable solution has the property 
that every measurement of the total energy is certain to return the value E. (That’s 
why I chose that letter for the separation constant.) 

3. The general solution is a linear combination of separable solutions. As 
we’re about to discover, the time-independent Schrodinger equation (Equation 2.4) 
yields an infinite collection of solutions (i/fi(x), \fr 2 (x), fi(x), ...), each with its 
associated value of the separation constant (E\, E 2 , E 3 , ...); thus there is a different 
wave function for each allowed energy: 

^i(x, t) = Mx)e~ iEll/ \ ty 2 (.x, t) = f 2 (x)e - iElt/h ,.... 

Now (as you can easily check for yourself) the (time-flfependent) Schrodinger equation 
(Equation 1.1) has the property that any linear combination 5 of solutions is itself a 
solution. Once we have found the separable solutions, then, we can immediately 
construct a much more general solution, of the form 

^(x, t) = 'Y^c n \l/n(x)e~ ,Ent/h . [2.14] 

n =1 

It so happens that every solution to the (time-dependent) Schrodinger equation can be 
written in this form—it is simply a matter of finding the right constants (ci, c 2 , ...) 
so as to fit the initial conditions for the problem at hand. You’ll see in the following 
sections how all this works out in practice, and in Chapter 3 we’ll put it into more 
elegant language, but the main point is this: Once you’ve solved the time-independent 


5 A linear combination of the functions f\ (z), f 2 {z), ... is an expression of the form 
/(z) = ci/iO)+ c 2 / 2 (z) + 


where c \, c 2 ,... are any (complex) constants. 
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Schrodinger equation, you’re essentially done; getting from there to the general so¬ 
lution of the time-Gfependent Schrodinger equation is simple and straightforward. 

* Problem 2.1 Prove the following theorems: 


(a) For normalizable solutions, the separation constant E must be real. Hint: Write 
E (in Equation 2.6) as £ 0 + IT (with E 0 and F real), and show that if Equation 
1.20 is to hold for all t, F must be zero. 

(b) f can always be taken to be real (unlike 'I', which is necessarily complex). 
Note : This doesn’t mean that every solution to the time-independent Schrodinger 
equation is real; what it says is that if you’ve got one that is not, it can always be 
expressed as a linear combination of solutions (with the same energy) that are. 
So in Equation 2.14 you might as well stick to ^’s that are real. Hint: If i/(x) 
satisfies the time-independent Schrodinger equation for a given E, so too does 
its complex conjugate, and hence also the real linear combinations (if + ir*) 
and i(if — if*). 

(c) If V(x) is an even function [i.e., V(-x) = V(x)], then f(x) can always be 
taken to be either even or odd. Hint: If f(x) satisfies the time-independent 
Schrodinger equation for a given E, so too does \jf(—x ), and hence also the 
even and odd linear combinations if(x) ± if(—.x). 

^Problem 2.2 Show that E must exceed the minimum value of V(x) for every 
normalizable solution to the time-independent Schrodinger equation. What is the 
classical analog to this statement? Hint: Rewrite Equation 2.4 in the form 


d 2 \(f 


— [V(x)-E]ifr, 

n 


if E < F m in, then f and its second derivative always have the same sign —argue that 
such a function cannot be normalized. 


2.2 THE INFINITE SQUARE WELL 


Suppose 


0, if 0 < x < a, 
oo, otherwise 


[2.15] 


(Figure 2.1). A particle in this potential is completely free, except at the two ends 
(x = 0 and jc = a), where an infinite force prevents it from escaping. A classical 
model would be a cart on a frictionless horizontal air track, with perfectly elastic 
bumpers—it just keeps bouncing back and forth forever. (This potential is awfully 
artificial, but I urge you to treat it with respect. Despite its simplicity—or rather, 
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Figure 2.1: The infinite square well 
potential (Equation 2.15). 


precisely because of its simplicity—it serves as a wonderfully accessible test case for 
all the fancy stuff that comes later. We’ll refer back to it frequently.) 

Outside the well, f (x) =0 (the probability of finding the particle there is zero). 
Inside the well, where V = 0, the time-independent Schrodinger equation (Equation 
2.4) reads 


h 2 d 2 f 
2m dx 2 


= Ef, 


[2.16] 


d 2 f 


= —k 2 if, where k = - 


[2.17] 


dx 2 r ’ ’’ h 

(By writing it in this way, I have tacitly assumed that E > 0; we know from Problem 
2.2 that E < 0 doesn’t work.) Equation 2.17 is the (classical) simple harmonic 
oscillator equation; the general solution is 


fix) — A sin£x + Bcoskx, 


[2.18] 


where A and B are arbitrary constants. Typically, these constants are fixed by the 
boundary conditions of the problem. What are the appropriate boundary conditions 
for i (r(x)l Ordinarily, both f and dif jdx are continuous, but where the potential 
goes to infinity only the first of these applies. (I’ll prove these boundary conditions, 
and account for the exception when V — oo, later on; for now I hope you will trust 
me.) 

Continuity of f (x) requires that 


MO) = M“) = 0, 


[2.19] 


so as to join onto the solution outside the well. What does this tell us about A and 
B1 Well, 

\j/( 0) — A sinO + 5cos0 = B, 


so B = 0, and hence 


fix) = A sin&x. 


[ 2 . 20 ] 


Then f(a) = A sin ka, so either A — 0 [in which case we’re left with the trivial— 
nonnormalizable—solution \j/(x) — 0], or else sin ka = 0, which means that 


ka = 0, ±7r, ±2jr, ±37r, 


[2.21] 
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But k = 0 is no good [again, that would imply ifr(x) = 0], and the negative solutions 
give nothing new, since sin(-0) = - sin(0) and we can absorb the minus sign into 
A. So the distinct solutions are 

k n = —, withw = 1, 2, 3, — [2.22] 

a 

Curiously, the boundary condition at x = a does not determine the constant A , 
but rather the constant k, and hence the possible values of E : 


h 2 k 2 n 

n 2 7t 2 Ti 2 

tn - 2m 

2m a 2 


[2.23] 


In sharp contrast to the classical case, a quantum particle in the infinite square well 
cannot have just any old energy—only these special allowed values. Well, how do 
we fix the constant A ? Answer: We normalize iff : 


f \A\ 2 sir?{kx)dx — \A\ 2 - = 1, so \A\ 2 =-. 

Jo la 

This only determines the magnitude of A, but it is simplest to pick the positive real 
root: A = *j2[a (the phase of A carries no physical significance anyway). Inside the 
well, then, the solutions are 



[2.24] 


As promised, the time-independent Schrodinger equation has delivered an infi¬ 
nite set of solutions, one for each integer n. The first few of these are plotted in Fig¬ 
ure 2.2; they look just like the standing waves on a string of length a. t/q, which car¬ 
ries the lowest energy, is called the ground state; the others, whose energies increase 
in proportion to n 2 , are called excited states. As a group, the functions f n (x) have 
some interesting and important properties: 

1. They are alternately even and odd, with respect to the center of the well. 
(iff i is even, i/q is odd, i/q is even, and so on. 6 ) 

2. As you go up in energy, each successive state has one more node (zero 
crossing). iff\ has none (the end points don’t count), fa has one, fa has two, and so 
on. 


6 To make this symmetry more apparent, some authors center the well at the origin (so that it runs 
from —a/2 to -fa/2. The even functions are then cosines, and the odd ones are sines. See Problem 2.4. 
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Vi(*) 


y 2 (x) 


V 3 (*) 






Figure 2.2: The first three stationary states of the infinite square well (Equation 
2.24). 


3. They are mutually orthogonal, in the sense that 

j i j/ m (x)*\{r n (x)dx = 0, 

whenever m ^ n. Proof 

J \Jr m (x)*\Jf n (x)dx = - j sin {^—x^j sin x^ dx 

= \ l b O'rM - cos (^*)] dx 

\ 1 . (m-n \ 1 . f m+n \ 1 i« 

= \ - -— sm - ttx — - -— sin ——tzx 1 

[(m—n)n \ a ) ( m+n)n \ a ) J lo 

_ 1 | sin[(m — n)jr] sin[(m + n)n 1 _ q 

n j (m-n) (m+n) J 


[2.25] 


Note that this argument does not work if m = n (can you spot the point at which 
it fails?); in that case normalization tells us that the integral is 1. In fact, we can 
combine orthogonality and normalization into a single statement 7 : 


/■ 


\j/ m (x)*if n (x)dx = 8 mn , [2.26] 

where 8 mn (the so-called Kronecker delta) is defined in the usual way, 

0, if m ^ n: 

' ^ [2.27 

1, if m =n. 

We say that the i/r’s are orthonormal. 

4. They are complete, in the sense that any other function, /(x), can be ex¬ 
pressed as a linear combination of them: 

oo p2 00 

f(x) = ^c n f n (x) = y- ^c n sin (~x^- [2.28] 


7 In this case the rfr 's are real, so the * on \j/ m is unnecessary, but for future purposes it’s a good 
idea to get in the habit of putting it there. 



28 


Chap. 2 The Time-Independent Schrodinger Equation 


I’m not about to prove the completeness of the functions *J2fa sm{rmx/a), but if 
you’ve studied advanced calculus you will recognize that Equation 2.28 is nothing 
but the Fourier series for /(*), and the fact that “any” function can be expanded 
in this way is sometimes called Dirichlet’s theorem. 8 The expansion coefficients 
(i c n ) can be evaluated—for a given f(x )—by a method I call Fourier’s trick, which 
beautifully exploits the orthonormality of {^/ n }: Multiply both sides of Equation 2.28 
by ^m(x)*, and integrate. 

/ OO p 00 

f m (x)*f(x)dx = J2 C " MxTi/ n (x)dx = CJmn = c m . [2.29] 

n =1 J n =1 

(Notice how the Kronecker delta kills every term in the sum except the one for which 
n — ra.) Thus the rath coefficient in the expansion of f(x) is given by 

c m = j if m (x)* f(x) dx. [2.30] 

These four properties are extremely powerful, and they are not peculiar to the 
infinite square well. The first is true whenever the potential itself is an even function; 
the second is universal, regardless of the shape of the potential. 9 Orthogonality is also 
quite general—I’ll show you the proof in Chapter 3. Completeness holds for all the 
potentials you are likely to encounter, but the proofs tend to be nasty and laborious; 
I’m afraid most physicists simply assume completeness and hope for the best. 

The stationary states (Equation 2.6) for the infinite square well are evidently 

¥„(*, 0 = -y/|sin (™ x) [2.31] 

I claimed (Equation 2.14) that the most general solution to the (time-dependent) 
Schrodinger equation is a linear combination of stationary states: 

»(*, t) = sin {™ x ) [2.32] 

If you doubt that this is a solution, by all means check it! It remains only for me to 
demonstrate that I can fit any prescribed initial wave function, 4* (x, 0), by appropriate 
choice of the coefficients c n . According to Equation 2.32, 

OC 

4/( X) 0) = Y.Cnfnix). 


8 See, for example, Mary Boas, Mathematical Methods in the Physical Sciences, 2nd ed. (New 
York: John Wiley & Sons, 1983), p. 313; fix) can even have a finite number of finite discontinuities. 

9 See, for example, John L. Powell and Bemd Crasemann, Quantum Mechanics (Reading, MA: 
Addison-Wesley, 1961), p. 126. 
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The completeness of the xf ’s (confirmed in this case by Dirichlet’s theorem) guarantees 
that I can always express 4>(x, 0) in this way, and their orthonormality licenses the 
use of Fourier’s trick to determine the actual coefficients: 

-/If sin(^-x) ^{x,0)dx. I (2.33] 

That does it: Given the initial wave function, vF(x, 0), we first compute the 
expansion coefficients c n , using Equation 2.33, and then plug these into Equation 2.32 
to obtain (x, t ). Armed with the wave function, we are in a position to compute any 
dynamical quantities of interest, using the procedures in Chapter 1. And this same 
ritual applies to any potential—the only things that change are the functional form of 
the if ’s and the equation for the allowed energies. 

Problem 2.3 Show that there is no acceptable solution to the (time-independent) 
Schrodinger equation (for the infinite square well) with E = 0 or E <0. (This is a 
special case of the general theorem in Problem 2.2, but this time do it by explicitly 
solving the Schrodinger equation and showing that you cannot meet the boundary 
conditions.) 

Problem 2.4 Solve the time-independent Schrodinger equation with appropriate 
boundary conditions for an infinite square well centered at the origin [V(x) = 0, for 
—a/2 < x < +a/2; V(x) = oo otherwise]. Check that your allowed energies are 
consistent with mine (Equation 2.23), and confirm that your i/r’s can be obtained from 
mine (Equation 2.24) by the substitution x -» x — a/2. 

♦Problem 2.5 Calculate (x), (x 2 ), ( p), (p 2 ), o x , and o p , for the «th stationary state 
of the infinite square well. Check that the uncertainty principle is satisfied. Which 
state comes closest to the uncertainty limit? 

♦♦Problem 2.6 A particle in the infinite square well has as its initial wave function 
an even mixture of the first two stationary states: 

^(x, 0) = A[\lr x {x) + foix)]. 

(a) Normalize vF(x, 0). (That is, find A. This is very easy if you exploit the 
orthonormality of \j/ x and f 2 . Recall that, having normalized at t = 0, you 
can rest assured that it stays normalized—if you doubt this, check it explicitly 
after doing part b.) 

(b) Find 4>(x, t) and ^(x, t)\ 2 . (Express the latter in terms of sinusoidal functions 
of time, eliminating the exponentials with the help of Euler’s formula: e l6 — 
cos 9 + i sin 9.) Let a> = n 2 fi/2ma 2 . 

(c) Compute (x). Notice that it oscillates in time. What is the frequency of the 
oscillation? What is the amplitude of the oscillation? (If your amplitude is 
greater than a/2, go directly to jail.) 
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(d) Compute { p) . (As Peter Lorre would say, “Do it ze kveek vay, Johnny!”) 

(e) Find the expectation value of H. How does it compare with E\ and £ 2 ? 

(f) A classical particle in this well would bounce back and forth between the walls. 
If its energy is equal to the expectation value you found in (e), what is the 
frequency of the classical motion? How does it compare with the quantum 
frequency you found in (c)? 

Problem 2.7 Although the overall phase constant of the wave function is of no 
physical significance (it cancels out whenever you calculate a measureable quantity), 
the relative phase of the expansion coefficients in Equation 2.14 does matter. For 
example, suppose we change the relative phase of ^1 and 1 fa in Problem 2.6: 

V(x,0) = A[t i (x) + e‘U 2 (x)l 

where 4> is some constant. Findfl'tx, t), ^(x, 0l 2 >and <x), and compare your results 
with what you got before. Study the special cases <f> = n/2 and <f> = n. 

❖Problem 2.8 A particle in the infinite square well has the initial wave function 

VP(x, 0) = Ax(a — x ). 

(a) Normalize ^(x,0). Graph it. Which stationary state does it most closely 
resemble? On that basis, estimate the expectation value of the energy. 

(b) Compute (x), (p ), and (H), at t = 0. {Note: This time you cannot get (p) by 
differentiating (x), because you only know (x) at one instant of time.) How 
does ( H) compare with your estimate in (a)? 

❖Problem 2.9 Find 4>(x, 0 for the initial wave function in Problem 2.8. Evaluate 
ci, C 2 , and C 3 numerically, to five decimal places, and comment on these numbers. 
(c„ tells you, roughly speaking, how much \[f n is “contained in” 4>.) Suppose you 
measured the energy at time to > 0, and got the value E 3 . Knowing that immediate 
repetition of the measurement must return the same value, what can you say about 
the coefficients c„ after the measurement? (This is an example of the “collapse of the 
wave function”, which we discussed briefly in Chapter 1.) 

❖Problem 2.10 The wave function (Equation 2.14) has got to be normalized; given 
that the V'Vi’s are orthonormal, what does this tell you about the coefficients c„? 
Answer: 

OQ 

X>| 2 = 1. [2-34] 

W = 1 

(In particular, \c n \ 2 is always < 1.) Show that 

00 

(H) = Y,En\c„\\ 


[2.35] 
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Incidentally, it follows that (H) is constant in time, which is one manifestation of 
conservation of energy in quantum mechanics. 


2.3 THE HARMONIC OSCILLATOR _ 

The paradigm for a classical harmonic oscillator is a mass m attached to a spring of 
force constant k. The motion is governed by Hooke’s law, 

T7 b dljc 

F = -lex = m —r 

dt 2 

(as always, we ignore friction), and the solution is 

x(t) = A sin {(Dt) + Bcos(cot), 

where 



is the (angular) frequency of oscillation. The potential energy is 

V(x) = jkx 2 - [2.37] 

its graph is a parabola. 

Of course, there’s no such thing as a perfect simple harmonic oscillator—if you 
stretch it too far the spring is going to break, and typically Hooke’s law fails long 
before that point is reached. But practically any potential is approximately parabolic, 
in the neighborhood of a local minimum (Figure 2.3). Formally, if we expand V (x) 
in a Taylor series about the minimum: 

V(x) = V(x 0 ) + V'<,x 0 )(x ~xo)+\ V"(x 0 )(x - xo) 2 + ■ ■ ■, 

subtract V (x 0 ) [you can add a constant to V(x) with impunity, since that doesn’t 
change the force], recognize that V'(xq) = 0 (since jco is a minimum), and drop the 
higher-order terms [which are negligible as long as (x —xq) stays small], the potential 
becomes 

V(x) = V"(x 0 )(x - xo) 2 , 

which describes simple harmonic oscillation (about the point xo), with an effective 
spring constant k = F"(xo ). 10 That’s why the simple harmonic oscillator is so 
important: Virtually any oscillatory motion is approximately simple harmonic, as 
long as the amplitude is small. 


10 Note that V " (xq ) > 0, since by assumption X[ } is a minimum. Only in the rare case V"(xu) = 0 
is the oscillation not even approximately simple harmonic. 
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Figure 2.3: Parabolic approximation (dashed curve) to an arbitrary potential, in 
the neighborhood of a local minimum. 

The quantum problem is to solve the Schrodinger equation for the potential 

V (x) = ^mo/x 2 [2.38] 

(it is customary to eliminate the spring constant in favor of the classical frequency, 
using Equation 2.36). As we have seen, it suffices to solve the time-independent 
Schrodinger equation: 

- T - ~TT + = Ex l f - [2.39] 

2m dx l 2 

In the literature you will find two entirely different approaches to this problem. The 
first is a straighforward “brute force” solution to the differential equation, using the 
method of power series expansion; it has the virtue that the same strategy can be 
applied to many other potentials (in fact, we’ll use it in Chapter 4 to treat the Coulomb 
potential). The second is a diabolically clever algebraic technique, using so-called 
ladder operators. I’ll show you the algebraic method first, because it is quicker and 
simpler (and more fun); if you want to skip the analytic method for now, that’s fine, 
but you should certainly plan to study it at some stage. 

2.3.1 Algebraic Method 

To begin with, let’s rewrite Equation 2.39 in a more suggestive form: 

2- (-4~) +(m<ox) 2 f = Ef. [2.40] 

2m \i dx ) 

The idea is to factor the term in square brackets. If these were numbers , it would be 
easy: 

u 2 + v 2 = (u — iv)(u + iv ). 
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Here, however, it’s not quite so simple, because u and v are operators, and operators 
do not, in general, commute (uv is not the same as vu). Still, this does invite us to 
take a look at the expressions 


a± = —== (— —j— i imo)x \ . [2.41] 

\j2m \i dx / 

What is their product, a _£*+? Warning: Operators can be slippery to work with 
in the abstract, and you are bound to make mistakes unless you give them a “test 
function”, f(x), to act on. At the end you can throw away the test function, and 
you’ll be left with an equation involving the operators alone. In the present case, we 
have 


(a-a + )f(x) = ± -<»<*) +'»<“) /« 

±(*f- lmaa ) ( n - d -L + imalxf ) 

2m \ i dx J \ i dx ) 


2m 

, d 

dO +nmw Tx ( 


= — —{xf) — hmcox-f. -j- (moj^) 2 /] 

2m [ dx 2 dx dx J 


2m \i dx ) 


— ] + (mcox) 2 + ftma) /(x). 




[I used d(x/)/dx = x(df/dx) + / in the last step.] Discarding the test function, we 
conclude that 


2m 


{--) 

\i dx J 


+ (mcox) 


+ -boo. [2.42] 


Evidently Equation 2.40 does not factor perfectly—there’s an extra term (\/2)hco. 
However, if we pull this over to the other side, the Schrodinger equation 11 becomes 


(a-a + — = E\J/. 


[2.43] 


Notice that the ordering of the factors a + and is important here; the same 
argument, with a + on the left, yields 


m 


Thus 


t —— ) + (mcox) 


a-a + — a + a_ = hco. 


- tuo. 

2 


[2.44] 


[2.45] 


11 I’m getting tired of writing “time-independent Schrodinger equation,” so when it’s clear from the 
context which one I mean, I’ll just call it the Schrodinger equation. 
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and the Schrodinger equation can also be written 

(a + a~ + -hco)\(f = E\jr. [2.46] 

Now, here comes the crucial step: I claim that if satisfies the Schrodinger 
equation, with energy E, then a + fi satisfies the Schrodinger equation with energy 
(E +haj). Proof: 


(a + a_ + -h(o)(a + fi) = ( a+a_a + + -hcoa + ) f 

— a + (a_a + + -hco)\}r = a + [(a_a + — -hco)\j/ + Ticofi] 

= a + (E\js = (E + h(jo){a + fi). QED 

[Notice that whereas the ordering of a + and a_ does matter, the ordering of a± and 
any constants (such as ft, co, and E) does not.] By the same token, a^fi is a solution 
with energy (E — fuo): 

(a_a + — ^ hco)(a-fi) = a_(a + a~ — -hco)fi 

= fl_[(a + o_ + — hcofi] — a~(Efi — ha) fi) 

= (E — hco)(a_if/). 

Here, then, is a wonderful machine for grinding out new solutions, with higher and 
lower energies—if we can just find one solution, to get started! We call a± ladder 
operators, because they allow us to climb up and down in energy; a + is called the 
raising operator, and a- the lowering operator. The “ladder” of states is illustrated 
in Figure 2.4. 

But wait! What if I apply the lowering operator repeatedly? Eventually I’m 
going to reach a state with energy less than zero, which (according to the general 
theorem in Problem 2.2) does not exist! At some point the machine must fail. How 
can that happen? We know that a-fi is a new solution to the Schrodinger equation, 
but there is no guarantee that it will be normalizable —it might be zero, or its square 
integral might be infinite. Problem 2.11 rules out the latter possibility. Conclusion: 
There must occur a “lowest rung” (let’s call it fif) such that 

a-fi o = 0. [2.47] 


1 /ft dfip 

sfhn \i dx 


— imojx 


to 


= 0, 


That is to say, 
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Figure 2.4: The ladder of stationary states 
for the simple harmonic oscillator. 


or 


d\jf o 
dx 


mco 
■■ — 
ft 


This differential equation for i/r 0 is easy to solve: 


/ dfo mco f , 

—— = —— / xdx =>■ In 

fo ft J 


mco ? 

tyo — —-^-x + constant, 


so 

fo(x) = A oe~^ 


[2.48] 


To determine the energy of this state, we plug it into the Schrodinger equation (in 
the form of Equation 2.46), ( a + a _ + (l/2)hco)\j/o = fi'oV'o, and exploit the fact that 
a-\j/o — 0. Evidently 


E 0 = 


-ft CO. 
2 


[2.49] 


With our foot now securely planted on the bottom rung 12 (the ground state of 
the quantum oscillator), we simply apply the raising operator to generate the excited 
states 13 : 


f n (x) = A n (a+) n e * x \ 


with E n = {n + -)ftco. 


[2.50] 


l2 Note that there can only be one ladder, because the lowest state is uniquely determined by Equation 
2.47. Thus we have in fact obtained all the (normalizable) solutions. 

13 In the case of the harmonic oscillator, it is convenient to depart from our usual custom and number 
the states starting with n = 0 instead of n — 1. Obviously, the lower limit on the sum in equations such 
as Equation 2.14 should be altered accordingly. 
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(This method does not immediately determine the normalization factor A„; I’ll let 
you work that out for yourself in Problem 2.12.) For example, 

_ mo) J1 1 / fl d \ mu> 2 

— A[a + e M — .4 1 ,_ (- \-imcox\e & 

V2m \i dx ) 

Ai \h ( mco \ _ mix i . -m x 2~\ 

= __ - I ——x I e » + imeoxe » , 

V2 m L i\ ft J J 

which simplifies to 

if\{x) = (iA\a)V2m)xe~^ : ' x2 . [2.51] 

I wouldn’t want to calculate i/r 50 in this way, but never mind: We have found all the 
allowed energies, and in principle we have determined the stationary states—the rest 
is just computation. 

Problem 2.11 Show that the lowering operator cannot generate a state of infinite 
norm (i.e., f \a-ilr\ 2 dx < oo, if xf/ itself is a normalized solution to the Schrodinger 
equation). What does this tell you in the case \Jr = i/r 0 ? Hint: Use integration by 
parts to show that 


/: 


/' 


dx = I \lr*(a + a-\fr)dx. 


Then invoke the Schrodinger equation (Equation 2.46) to obtain 


r 


\a-\j/\ 2 dx = E — -fio), 


where E is the energy of the state \jr. 


* Problem 2.12 


(a) The raising and lowering operators generate new solutions to the Schrodinger 
equation, but these new solutions are not correctly normalized. Thus a + \jf n 
is proportional to \f/ n+ 1 , and a^ n is proportional to \fr„-u but we’d like to 
know the precise proportionality constants. Use integration by parts and the 
Schrodinger equation (Equations 2.43 and 2.46) to show that 

/ OO pOO 

\a + xlr n \ 2 dx = (n + l)heo, I \a-ty n \ 2 dx = nhw, 

-oo J —OO 

and hence (with f s to keep the wavefunctions real) 

a+fn = iy/(n + l)h( 0 \fr n+u [2.52] 

a-\jf n = —i*Jnh(o\{f, 


[2.53] 
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(b) Use Equation 2.52 to determine the normalization constant A n in Equation 2.50. 
(You’ll have to normalize \fr 0 “by hand”.) Answer : 


/ mcp\ 1/4 (-if 
\ 7 tTi ) *Jn\(fi(oy 


[2.54] 


❖Problem 2.13 Using the methods and results of this section, 

(a) Normalize fi (Equation 2.51) by direct integration. Check your answer against 
the general formula (Equation 2.54). 

(b) Find ^ 2 , but don’t bother to normalize it. 

(c) Sketch ifro, i/q, and i/r 2 . 

(d) Check the orthogonality of tyo, and \jr 2 - Note: If you exploit the evenness 
and oddness of the functions, there is really only one integral left to evaluate 
explicitly. 


❖Problem 2.14 Using the results of Problems 2.12 and 2.13, 

(a) Compute (x), {p), <x 2 ), and (p 2 ), for the states i/'o and f\. Note : In this and 
most problems involving the harmonic oscillator, it simplifies the notation if 
you introduce the variable £ = *Jmco/h x and the constant a = (moj/it'h) ] ' /4 . 

(b) Check the uncertainty principle for these states. 

(c) Compute ( T) and (V) for these states (no new integration allowed!). Is their 
sum what you would expect? 


2.3.2 Analytic Method 


We return now to the Schrodinger equation for the harmonic oscillator (Equa¬ 
tion 2.39): 

h 1 d 2 f 1 2 2, r . 

-— + -ma) l x l f = Ef. 

2m dx z 2 

Things look a little cleaner if we introduce the dimensionless variable 

i.yf,; B.J51 


in terms of the Schrodinger equation reads 
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where K is the energy, in units of (1 f2)ha): 


K = ■ 


2E 

ha) 


[2.57] 


Our problem is to solve Equation 2.56, and in the process obtain the “allowed” values 
of K (and hence of E). 

To begin with, note that at very large £ (which is to say, at very large x), £ 2 
completely dominates over the constant K , so in this regime 


d 2 ifr 

w 


fV. 


[ 2 . 58 ] 


which has the approximate solution (check it!) 


W) * Ae-?/ 1 + Be + ^ 2 . 


[2.59] 


The B term is clearly not normalizable (it blows up as |x| -> oo); the physically 
acceptable solutions, then, have the asymptotic form 

at large f. [2.60] 

This suggests that we “peel off’ the exponential part, 

*($) = [ 2 . 61 ] 


in hopes that what remains [A(£)] has a simpler functional form than f(£) itself. 14 
Differentiating Equation 2.61, we have 


d ± = ( d JL_ l :h\ e -?r. 

dH \dl ) 


and 


so the Schrodinger equation (Equation 2.56) becomes 


d 2 h 

w 


-2^ + (K-\)h=0. 


[2.62] 


I propose to look for a solution to Equation 2.62 in the form of a power series 

in £ 15 : 

00 

HH) = no + flit + flit 2 + • • ■ = X) «/?•''• [2.63] 

J -o 


14 Note that although we invoked some approximations to motivate Equation 2.61, what follows is 
exact. The device of stripping off the asymptotic behavior is the standard first step in the power series 
method for solving differential equations—see, for example, Boas (cited in footnote 8), Chapter 12. 

15 According to Taylor’s theorem, any reasonably well-behaved function can be expressed as a power 
series, so Equation 2.63 involves no real loss of generality. For conditions on the applicability of the series 
method, see Boas (cited in footnote 8) or George Arfken, Mathematical Methods for Physicists, 3rd ed. 
(Orlando, FL: Academic Press, 1985), Section 8.5. 
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Differentiating the series term by term, 

^ = ai + + 3fl 3 f 2 + ■•• = £ j a jC~', 

dt i H 

and 

~ = 2a 2 + 2 ■ 3a 3 f + 3 ■ 4+.••• = £](;' + 1)(/ + 2)a /+2 f-'. 

“? ;=o 

Putting these into Equation 2.62, we find 

DO 

Y, [O' + DO + 2)« j+ 2 - 2 jaj + (K - l)o ; ] c = 0. [2.64] 

7=0 

It follows (from the uniqueness of power series expansions 16 ) that the coefficient of 
each power of £ must vanish, 

0 + 1)0' + 2)a/+2 - 2 jaj + (K - 1 )aj = 0, 


and hence that 


(2 j + 1 -K) 

a '* 2 0+D0 + 2) aj 


[2.65] 


This recursion formula is entirely equivalent to the Schrodinger equation itself. 
Given 0 O it enables us (in principle) to generate 0 2 , 04 , a<>, ..., and given a i it generates 
u 3 , 05 , 07 , ... . Let us write 


A(^)=Aev en (l)+Aodd(f), 


[ 2 . 66 ] 


where 

^even(l) = «0 + «2§ 2 + + ' " 

is an even function of £ (since it involves only even powers), built on 00 , and 

^odd(t) = + a 3% 3 + a 5% 5 H 

is an odd function, built on 01 . Thus Equation 2.65 determines h(£ ) in terms of two 
arbitrary constants (00 and 01 )—which is just what we would expect, for a second- 
order differential equation. 

However, not all the solutions so obtained are normalizable. For at very large 
j, the recursion formula becomes (approximately) 

2 

a j+2 ~ -aj. 


16 See, for example, Arfken (footnote 15), Section 5.7. 
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with the (approximate) solution 

C 


for some constant C, and this yields (at large f, where the higher powers dominate) 

Ad) C V K Ce$\ 

Now, if h goes like exp(£ 2 ), then (remember xfrl —that’s what we’re trying to calcu¬ 
late) goes like exp(£ 2 /2) (Equation 2.61), which is precisely the asymptotic behavior 
we don’t want. 17 There is only one way to wiggle out of this: For normalizable solu¬ 
tions the power series must terminate. There must occur some “highest” j (call it n) 
such that the recursion formula spits out a n+ i = 0 (this will truncate either the series 
/i ev en or the series h 0 ^\ the other one must be zero from the start). For physically 
acceptable solutions, then, we must have 

K = 2n + \, 

for some positive integer n, which is to say (referring to Equation 2.57) that the energy 
must be of the form 


E n = (n+ i )ha), for n- 0, 1, 2,.... [2.67] 

Thus we recover, by a completely different method, the fundamental quantization 
condition we found algebraically in Equation 2.50. 

For the allowed values of K, the recursion formula reads 


_ -2(n ~ j) 

“ J+2 ~ U + 1)0 + 2 ) aj 


[ 2 . 68 ] 


If n = 0, there is only one term in the series (we must pick a\ = 0 to kill /i odd , and 
j = 0 in Equation 2.68 yields <22 = 0): 


Ao(£) = 


and hence 

iM!) = a 0 e ~^ 2/2 

(which reproduces Equation 2.48). For n = 1 we pick ao = 0, 18 and Equation 2.68 
with j = 1 yields a-\ — 0, so 

Aid) = oil. 


17 It’s no surprise that the ill-behaved solutions are still contained in Equation 2.65; this recursion 
relation is equivalent to the Schrodinger equation, so it’s got to include both the asymptotic forms we found 
in Equation 2.59. 

18 Note that there is a completely different set of coefficients aj for each value of n. 
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and hence 

(confirming Equation 2.51). For n = 2, j = 0 yields a 2 = — 2ao, and j = 2 gives 
CI 4 = 0, so 

A2«)=«o(l-2f 2 ) 

and 

iM?) =flo(l /2 , 

and so on. (Compare Problem 2.13, where the same result was obtained by algebraic 
means.) 

In general, h n (£) will be a polynomial of degree n in f, involving even powers 
only, if n is an even integer, and odd powers only, if n is an odd integer. Apart from 
the overall factor (a 0 or a\) they are the so-called Hermite polynomials, //„(£)• 19 
The first few of them are listed in Table 2.1. By tradition, the arbitrary multiplicative 
factor is chosen so that the coefficient of the highest power of % is 2". With this 
convention, the normalized 20 stationary states for the harmonic oscillator are 

^ w = (^) 1/4 vfe H " ( ^ !/2 [2 ' 69] 

They are identical (of course) to the ones we obtained algebraically in Equation 2.50. 
In Figure 2.5a I have plotted f n (x) for the first few n’s. 

The quantum oscillator is strikingly different from its classical counterpart— 
not only are the energies quantized, but the position distributions have some bizarre 
features. For instance, the probability of finding the particle outside the classically 
allowed range (that is, with x greater than the classical amplitude for the energy 
in question) is not zero (see Problem 2.15), and in all odd states the probability of 

Table 2.1: The first few Hermite polynomials, H n (x). 


Ho 

= 1, 


Hy 

= 2x. 


h 2 

= 4x 2 - 

-2, 

Hi 

— 8x 3 - 

- I2x, 

Ha 

= 16x 4 

— 4Sx 2 + 12, 

h 5 

= 32x 5 

- 160x 3 + 120*. 



19 The Hermite polynomials have been studied extensively in the mathematical literature, and there 
are many tools and tricks for working with them. A few of these are explored in Problem 2.18. 

20 I shall not work out the normalization constant here; if you are interested in knowing how it is 
done, see, for example, Leonard Schiff, Quantum Mechanics, 3rd ed. (New York: McGraw-Hill, 1968), 
Section 13. 
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finding the particle at the center of the potential well is zero. Only at relatively large 
n do we begin to see some resemblance to the classical case. In Figure 2.5b I have 
superimposed the classical position distribution on the quantum one (for n = 100 ); if 
you smoothed out the bumps in the latter, the two would fit pretty well (however, in 
the classical case we are talking about the distribution of positions over time for one 
oscillator, whereas in the quantum case we are talking about the distribution over an 
ensemble of identically-prepared systems ). 21 

Problem 2.15 In the ground state of the harmonic oscillator, what is the probability 
(correct to three significant digits) of finding the particle outside the classically allowed 
region? Hint: Look in a math table under “Normal Distribution” or “Error Function”. 


Problem 2.16 Use the recursion formula (Equation 2.68) to work out H^) and 

^Problem 2.17 A particle in the harmonic oscillator potential has the initial wave 
function 

^(x, 0) = A[f 0 (x) + xjriQc)] 

for some constant A. 

(a) Normalize 'F (x, 0). 

(b) FindvF(x,0and|'I'(x,0l 2 . 

(c) Find the expectation value of x as a function of time. Notice that it oscillates 
sinusoidally. What is the amplitude of the oscillation? What is its (angular) 
frequency? 

(d) Use your result in (c) to determine {p). Check that Ehrenfest’s theorem holds 
for this wave function. 

(e) Referring to Figure 2.5, sketch the graph of j'Fj at t = 0, 7 r/w, iTt/to, Sn/o), 
and 4 tt/ co. (Your graphs don’t have to be fancy—just a rough picture to show 
the oscillation.) 


** Problem 2.18 In this problem we explore some of the more useful theorems (stated 
without proof) involving Hermite polynomials. 

(a) The Rodrigues formula states that 

H„(S) = (-l)V 2 (T)V-f ! . [2.70] 

dH 

Use it to derive f /3 and H 4 . 


21 The analogy is perhaps more telling if you interpret the classical distribution as an ensemble of 
oscillators all with the same energy, but with random starting times. 
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(b) The following recursion relation gives you H n+ 1 in terms of the two preceding 
Hermite polynomials: 


Hn+m = 2£#„(£) ~ 2nH n ^). [2.71] 

Use it, together with your answer to (a), to obtain /A and H^. 

(c) If you differentiate an nth-order polynomial, you get a polynomial of order 
(n — 1). For the Hermite polynomials, in fact, 

dH n 

-^L=2nH n ^). [2.72] 

Check this, by differentiating IU and H$. 

(d) H n (£) is the nth z-derivative, at z = 0, of the generating function exp(—z 2 + 
2z£); or, to put it another way, it is the coefficient of z n fn \ in the Taylor series 
expansion for this function: 


e -.- ! +2zf = g f[2.73] 

11=0 n - 

Use this to rederive Hq, H\, and H 2 . 


2.4 THE FREE PARTICLE 


We turn next to what should have been the simplest case of all: the free particle 
[ V (x) = 0 everywhere]. As you’ll see in a moment, the free particle is in fact a 
surprisingly subtle and tricky example. The time-independent Schrodinger equation 
reads 

= [2-74] 


2 m dx 2 


d V 12 , 

dj = ~ k +• 


where k = ■ 


[2.75] 


So far, it’s the same as inside the infinite square well (Equation 2.17), where the 
potential is also zero; this time, however, I prefer to write the general solution in 
exponential form (instead of sines and cosines) for reasons that will appear in due 
course: 

[2.76] 


Unlike the infinite square well, there are no boundary conditions to restrict the possible 
values of k (and hence of E)\ the free particle can carry any (positive) energy. Tacking 
on the standard time dependence, exp(— iEt/fi), 


^(x, t) = Ae ik(x ~^ t} + Be~ ik{x+ ^ t] . 


[2.77] 
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Now, any function of x and t that depends on these variables in the special 
combination (x ± v t) (for some constant u) represents a wave of fixed profile, traveling 
in the =px-direction, at speed v. A fixed point on the waveform (for example, a 
maximum or a minimum) corresponds to a fixed value of the argument, and hence to 
x and t such that 


x ± vt = constant, or x = + constant, 


which is the formula for motion in the =px-direction with constant speed v. Since 
every point on the waveform is moving along with the same velocity, its shape doesn’t 
change as it propagates. Thus the first term in Equation 2.77 represents a wave 
traveling to the right, and the second term represents a wave (of the same energy) 
going to the left. By the way, since they only differ by the sign in front of k, we might 
as well write 

V k (x,t) = Ae i{kx ~^ { \ [2.78] 

and let k run negative to cover the case of waves traveling to the left: 


k = 



I k > 0 =>■ traveling to the right, 

k < 0 => traveling to the left. 


[2.79] 


The speed of these waves (the coefficient of t over the coefficient of x) is 


_m_ nr 

tVa „‘' ,m “ 2m “ V 2m 


[ 2 . 80 ] 


On the other hand, the classical speed of a free particle with energy E is given by 
E = (1/2 )mv 2 (pure kinetic, since V = 0), so 


^classical — 



quantum • 


[2.81] 


Evidently the quantum mechanical wave function travels at half the speed of the 
particle it is supposed to represent! We’ll return to this paradox in a moment—there 
is an even more serious problem we need to confront first: This wave function is not 
normalizable! For 


/ +oo r+oo 

%^ k dx = \A\ 2 j ldx = \A\ 2 (oo). [2.82] 

In the case of the free particle, then, the separable solutions do not represent physically 
realizable states. A free particle cannot exist in a stationary state; or, to put it another 
way, there is no such thing as a free particle with a definite energy. 

But that doesn’t mean the separable solutions are of no use to us. For they 
play a mathematical role that is entirely independent of their physical interpretation: 
The general solution to the time-dependent Schrodinger equation is still a linear 
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combination of separable solutions (only this time it’s an integral over the continuous 
variable k, instead of a sum over the discrete index n): 

1 r +0 ° 

*0e,<) = ^=/ m [2.83] 

V27T J-oo 

[The quantity I/a/Ztt is factored out for convenience; what plays the role of the 
coefficient c n in Equation 2.14 is the combination (1/V2 tt)0(£) dk.] Now this wave 
function can be normalized [for appropriate 4>(k)]. But it necessarily carries a range 
of k’ s, and hence a range of energies and speeds. We call it a wave packet. 

In the generic quantum problem, we are given ^(x, 0), and we are to find 
q>(x, t). For a free particle the solution has the form of Equation 2.83; the only 
remaining question is how to determine 0 (&) so as to fit the initial wave function: 

1 f +0 0 

*(x,0) = — <P(k)e ,tx dk. [2.84] 

V 2 71 J—o c 


This is a classic problem in Fourier analysis; the answer is provided by Plancherel’s 
theorem (see Problem 2.20): 



F(k) is called the Fourier transform of /(x); /(x) is the inverse Fourier transform 
of F(k) (the only difference is in the sign of the exponent). There is, of course, some 
restriction on the allowable functions: The integrals have to exist. 22 For our purposes 
this is guaranteed by the physical requirement that ^(x, 0) itself be normalized. So 
the solution to the generic quantum problem, for the free particle, is Equation 2.83, 
with ___ 

1 p+oo 

<t,(k) = ^=\ »(x,0 )e-‘ ix dx. [2.86] 

V 2 71 J— OO 

I’d love to work out an example for you—starting with a specific function ^(x, 0) 
for which we could actually calculate <f>(k), and then doing the integral in Equation 
2.83 to obtain 4>(x, t) in closed form. Unfortunately, manageable cases are hard to 

22 The necessary and sufficient condition on f(x) is that j ^ \f(x)\ 2 dx be finite. (In that case 
\ F(k)\ 2 dk is also finite, and in fact the two integrals are equal.) See Arfken (footnote 15), Sec¬ 
tion 15.5. 
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come by, and I want to save the best example for you to work out yourself. Be sure, 
therefore, to study Problem 2.22 with particular care. 

I return now to the paradox noted earlier—the fact that the separable solution 
{x , t) travels at the “wrong” speed for the particle it ostensibly represents. Strictly 
speaking, the problem evaporated when we discovered that ^ is not a physically 
achievable state. Nevertheless, it is of interest to discover how information about 
the particle velocity is carried by the wave function (Equation 2.83). The essential 
idea is this: A wave packet is a sinusoidal function whose amplitude is modulated 
by tj> (Figure 2.6); it consists of “ripples” contained within an “envelope.” What 
corresponds to the particle velocity is not the speed of the individual ripples (the so- 
called phase velocity), but rather the speed of the envelope (the group velocity)— 
which, depending on the nature of the waves, can be greater than, less than, or equal 
to the velocity of the ripples that go to make it up. For waves on a string, the group 
velocity is the same as the phase velocity. For water waves it is one half the phase 
velocity, as you may have noticed when you toss a rock into a pond: If you concentrate 
on a particular ripple, you will see it build up from the rear, move forward through 
the group, and fade away at the front, while the group as a whole propagates out at 
half the speed. What I need to show is that for the wave function of a free particle 
in quantum mechanics the group velocity is twice the phase velocity—just right to 
represent the classical particle speed. 

The problem, then, is to determine the group velocity of a wave packet with the 
general form 

4>(x,0 = / + 4>{k)e i(kx ~ wt) dk. 

\2tc J— oo 

[In our case to = (hk 1 j2m). but what I have to say now applies to any kind of wave 
packet, regardless of its dispersion relation —the formula for to as a function of k .] 
Let us assume that <f>{k) is narrowly peaked about some particular value ko. [There 
is nothing illegal about a broad spread in k, but such wave packets change shape 
rapidly (since different components travel at different speeds), so the whole notion 
of a “group,” with a well-defined velocity, loses its meaning.] Since the integrand 



Figure 2.6: A wave packet. The 
“envelope” travels at the group velocity; 
the “ripples” travel at the phase velocity. 
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is negligible except in the vicinity of A 0 , we may as well Taylor-expand the function 
co(k) about that point and keep only the leading terms: 

co(k) ~ 0)0 + (A - A 0 ), 

where co' 0 is the derivative of co with respect to A, at the point Ao. 

Changing variables from A to 5 = A - Ao, to center the integral at Ao, we have 

1 f +co 

q/(x, t) ^ ^ 

V 2 7T J —00 


At f = 0, 

1 / ,+co 

^(x, 0) = -== / 0(A O + £/j, 

V2jt J—oo 

and at later times 

1 /*+°o 

>P(x, 0 = -—e i( ~ olot+ko01 'o t) / 0(A O + ds. 

\fl7Z J —oo 


Except for the shift from x to (x - co f 0 t ), the integral is the same as the one in 'I' (x, 0). 
Thus 

^(x, t) = - co' 0 t, 0). [2.87] 

Apart from the phase factor in front (which won’t affect )^| 2 in any event), the wave 
packet evidently moves along at a speed 


da) 

%oup = lk 


[ 2 . 88 ] 


(evaluated at A = Ao), which is to be contrasted with the ordinary phase velocity 


CO 

t^phase = • 


[2.89] 


In our case, co = ( tik 2 jlm ), so oo/k = (hk/2m), whereas dco/dk = (hkjm), which 
is twice as great. This confirms that it is the group velocity of the wave packet, not 
the phase velocity of the stationary states, that matches the classical particle velocity: 

^classical = t^group := 2Uphase- [2.90] 


Problem 2.19 Show that the expressions [Ae ikx + Be ikx ], [C cos Ax + D sin Ax], 
[F cos(Ax+a)L and [G sin(Ax+/?)] are equivalent ways of writing the same function 
of x, and determine the constants C, D, F , G, a, and in terms of A and B. (In 
quantum mechanics, with V = 0, the exponentials give rise to traveling waves, 
and are most convenient in discussing the free particle, whereas sines and cosines 
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correspond to standing waves, which arise naturally in the case of the infinite square 
well.) Assume the function is real. 

**Problem 2.20 This problem is designed to guide you through a “proof” of Plan- 
cherel’s theorem, by starting with the theory of ordinary Fourier series on a finite 
interval, and allowing that interval to expand to infinity. 

(a) Dirichlet’s theorem says that “any” function fix) on the interval [—a, +«] can 
be expanded as a Fourier series: 

OQ 

f(x) = sin (nnx/a) + b n cos(«7rx/a)]. 

«=o 

Show that this can be written equivalently as 

/(*) = £ c n e innx/a . 

A2 = -OC 


What is c n , in terms of a n and b n ? 

(b) Show (by appropriate modification of Fourier’s trick) that 

c n = “ [ “ f{x)e~ in7ix/a dx. 

2 a J_ a 

(c) Eliminate n and c„ in favor of the new variables k = innja) and F(k) = 
«J2/tz ac n . Show that (a) and (b) now become 

f{x) = F(k)e ikx Ak- F(k) = —L= f + ° f(x)e~ ikx dx, 

y/2n n t^oo '/Fit J—a 

where is the increment in k from one n to the next. 

(d) Take the limit a -»■ oo to obtain Plancherel’s theorem. Note: In view of their 
quite different origins, it is surprising (and delightful) that the two formulas 
[one for F(k) in terms of f(x), the other for f(x) in terms of F’(Ar)] have such 
a similar structure in the limit a oo. 


Problem 2.21 Suppose a free particle, which is initially localized in the range 
—a < x < a, is released at time t = 0: 


T>(x, 0) 


A, if —a < x < a, 
0, otherwise, 


where A and a are positive real constants. 
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(a) Determine A, by normalizing 'L. 

(b) Determine <fc(k) (Equation 2.86). 

(c) Comment on the behavior of <p(k) for very small and very large values of a. 
How does this relate to the uncertainty principle? 


^Problem 2.22 A free particle has the initial wave function 
^(*,0) = Ae~ ax \ 
where A and a are constants {a is real and positive). 


(a) Normalize 0). 

(b) Find 'I'Or, 0* Hint: Integrals of the form 



e -(ax 2 +bx) 


dx 


can be handled by “completing the square.” Let y = ^/a[x + (b/2a)], and note 
that (ax 2 T bx) = y 2 - ( b 2 /4a ). Answer : 


V(x, t) 


2 a \ l/A e -^ 2 /H+(2 ihat/m)] 

j r / VI + (2 ihat/m) 


(c) Find |^(a:V)| 2 - Express your answer in terms of the quantity w = 
■yJa/[\ + (2 hat/m) 2 ]. Sketch I'Ll 2 (as a function of x) at t = 0, and again 
for some very large t. Qualitatively, what happens to I'L | 2 as time goes on? 

<d) Find (x), (p), (x 2 ), ( p 2 ), a x , and <j p . Partial answer: (p 2 ) = ah 2 , but it may 
take some algebra to reduce it to this simple form. 

(e) Does the uncertainty principle hold? At what time t does the system come 
closest to the uncertainty limit? 


2.5 THE DELTA-FUNCTION POTENTIAL 


We have encountered two very different kinds of solutions to the time-independent 
Schrodinger equation: For the infinite square well and the harmonic oscillator they 
are normalizable, and labeled by a discrete index n\ for the free paticle they are 
non-normalizable, and labeled by a continuous variable k. The former represent 
physically realizable states in their own right, the latter do not; but in both cases 
the general solution to the time-dependent Schrodinger equation is a linear combina¬ 
tion of stationary states—for the first type this combination takes the form of a sum 
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(over n), whereas for the second it is an integral (over k). What is the physical 
significance of this distinction? 

In classical mechanics a one-dimensional time-independent potential can give 
rise to two rather different kinds of motion. If V (x) rises higher than the particle’s to¬ 
tal energy (£) on either side (Figure 2.7a), then the particle is “stuck” in the potential 
well—it rocks back and forth between the turning points, but it cannot escape (unless, 



(a) 



(c) 


Figure 2.7: (a) A bound state, (b) Scattering states, (c) A classical bound state, 
but a quantum scattering state. 
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of course, you provide it with a source of extra energy, such as a motor, but we’re 
not talking about that). We call this a bound state. If, on the other hand, E exceeds 
V(x) on one side (or both), then the particle comes in from “infinity”, slows down or 
speeds up under the influence of the potential, and returns to infinity (Figure 2.7b). 
(It can’t get trapped in the potential unless there is some mechanism, such as friction, 
to dissipate energy, but again, we’re not talking about that.) We call this a scattering 
state. Some potentials admit only bound states (for instance, the harmonic oscillator); 
some allow only scattering states (a potential hill with no dips in it, for example); 
some permit both kinds, depending on the energy of the particle. 

As you have probably guessed, the two kinds of solutions to the Schrodinger 
equation correspond precisely to bound and scattering states. The distinction is even 
cleaner in the quantum domain, because the phenomenon of tunneling (which we’ll 
come to shortly) allows the particle to “leak” through any finite potential barrier, so 
the only thing that matters is the potential at infinity (Figure 2.7c): 

I E < F(-oo) and F(+oo) => bound state, T2 911 

E > V (-oo) or V (+oo) =>■ scattering state. 


In “real life” most potentials go to zero at infinity, in which case the criterion simplifies 
even further: 

I E < 0 =>• bound state, ^ 

E > 0 =>• scattering state. 

Because the infinite square well and harmonic oscillator potentials go to infinity as 
x -* ±oo, they admit bound states only; because the free particle potential is zero 
everywhere, it only allows scattering states. 23 In this section (and the following one) 
we shall explore potentials that give rise to both kinds of states. 

The Dirac delta function, 5(x), is defined informally as follows: 

«*> = («, [2 ' 93] 
It is an infinitely high, infinitesimally narrow spike at the origin, whose area is 1 
(Figure 2.8). Technically, it’s not a function at all, since it is not finite at x = 0 
(mathematicians call it a generalized function, or distribution). 24 Nevertheless, it is 
an extremely useful construct in theoretical physics. (For example, in electrodynam¬ 
ics the charge density of a point charge is a delta function.) Notice that 5(x — a) would 


23 If you are very observant, and awfully fastidious, you may have noticed that the general theorem 
requiring E > V mm (Problem 2.2) does not really apply to scattering states, since they are not normalizable 
anyway. If this bothers you, try solving the Schrodinger equation with E < 0, for the free particle, and 
note that even linear combinations of these solutions cannot be normalized. The positive energy solutions 
by themselves constitute a complete set. 

24 The delta function can be thought of as the limit of a sequence of functions, such as rectangles 
(or triangles) of ever-increasing height and ever-decreasing width. 
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Figure 2.8: The Dirac delta function 
(Equation 2.93). 


be a spike of area 1 at the point a. If you multiply S(x — a) by an ordinary function 
/(x), it’s the same as multiplying by / (a): 

f(x)8(x -a) = f(a)8(x - a ), [2.941 

because the product is zero anyway except at the point a. In particular, 

/ +00 f +00 

f(x)8(x—a)dx = f(a) J 8 (x — a) dx = f(a). [2.95] 

That’s the most important property of the delta function: Under the integral sign it 
serves to “pick out” the value of /(x) at the point a. (Of course, the integral need 
not go from — oo to +oo; all that matters is that the domain of integration include the 
point a, so a — € to a + c would do, for any € > 0.) 

Let’s consider a potential of the form 


V(x) = —aS(x), 


[2-96] 


where a is some constant. This is an artificial potential (so was the infinite square 
well), but it’s beautifully simple and in some respects closer to reality than any of the 
potentials we have considered so far. The Schrodinger equation reads 

h 2 d 2 ir 

[297] 

This potential yields both bound states (E < 0) and scattering states (E > 0); we’ll 
look first at the bound states. 

In the region x < 0, V (x) = 0, so 


d 2 \jr 
~dx^ 


2m E 


K 2 \jf, 


[2.98] 


where 

yj—2m E 


k = 


ti 


[2.99] 
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(E is negative, by assumption, so k is real and positive.) The general solution to 
Equation 2.98 is 

$(x) = Ae~ KX + Be KX , [2.100] 

but the first term blows up as x —> — oo, so we must choose A = 0: 

f(x) = Be KX , (x < 0). [2.101] 


In the region x > 0, V (x) is again zero, and the general solution is of the form 
F exp(— kx) + G exp(/cx); this time it’s the second term that blows up (as x -► +oo), 
so 


\Jr(x) = Fe KX , (x > 0). 


[ 2 . 102 ] 


It remains only to stitch these two functions together, using the appropriate 
boundary conditions at x = 0. I quoted earlier the standard boundary conditions 
for \(r: 


1. \jf is always continuous, and 

2. d'ljf/dx is continuous except at points 
where the potential is infinite. 


[2.103] 


In this case the first boundary condition tells us that F = B, so 


f(x) = 


Be KX , 

Be~ KX 


O < 0), 

(x > 0). 


[2.104] 


[i fr(x) is plotted in Figure 2.9.] The second boundary condition tells us nothing; this 
is (like the infinite square well) the exceptional case where V is infinite at the join, and 
it’s clear from the graph that this function has a kink at x = 0. Moreover, up to this 
point the delta function has not come into the story at all. Evidently the delta function 
must determine the discontinuity in the derivative of t^, at x =0. I’ll show you now 
how this works, and as a byproduct we’ll see why d^rjdx is ordinarily continuous. 

The idea is to integrate the Schrodinger equation, from —e to +e, and then take 
the limit as € —* 0: 



Figure 2.9: Bound state wave function for the delta function potential (Equation 
2.104). 
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*1 f +e 

2m J_ e 


d 2 fi 


-t 


dx + I V (x)fi(x)dx 



fi{x) dx. 


[2.105] 


The first integral is nothing but dfi/dx, evaluated at the two end points; the last 
integral is zero , in the limit e ^ 0, since it’s the area of a sliver with vanishing width 
and finite height. Thus 

a 2m Hm /• V{x)fi{x)dx. [2.106] 

\dx / ti 2 ^oJ_ € 

Ordinarily, the limit on the right is again zero, and hence dfi/dx is continuous. 
But when V (x) is infinite at the boundary, that argument fails. In particular, if 
V (x) = — a<5(x), Equation 2.95 yields 


A 



2 ma _ 
--j-HO). 


[2.107] 


For the case at hand (Equation 2.104), 


dfijdx = —Bice KX , for (x > 0), so dfi/dx\ + — —Bk, 
dfijdx = -\-Bice +KX , for (x < 0), so dfi/dx\_ — +Bk, 

and hence A (dfi/dx) = —2 Bk. And fi(0) = B. So Equation 2.107 says 

ma 

K — — 7 T, 

ft 2 

and the allowed energy (Equation 2.99) is 

^ h 2 tc 2 ma 2 
~ ~ ~^2fi 2 ' 


Finally, we normalize fi : 

f +oo 


/ +oo /•« 

\if(x)\ 2 dx = 2\B\ 2 / 

-oo Jo 


X , i^l 2 , 

dx =- — 1, 


[2.108] 


[2.109] 


so (choosing, for convenience, the positive real root): 


B = *Jlc = 


/ma 
h 


[ 2 . 110 ] 


Evidently the delta-function well, regardless of its “strength” a, has exactly one bound 
state: 



‘■\*\/& 2 . £ 


ma 2 


[ 2 . 111 ] 




Chap. 2 The Time-Independent Schrodinger Equation 


What about scattering states, with E > 0? For x < 0 the Schrodinger equation 


reads 


d 2 f 

dx 2 


2m E 

-ir* 


= ~k 2 f, 


where 



[ 2 . 112 ] 


is real and positive. The general solution is 

f(x) = Ae‘ kx + Be ~ ikx , [2.113] 

and this time we cannot rule out either term, since neither of them blows up. Similarly, 
for* > 0, 

rfr(x) = Fe ikx + Ge~ ikx . [2.114] 

The continuity of ft (x) at x — 0 requires that 

F + G = A + B. [2.115] 


The derivatives are 

| dftjdx = ik(Fe‘ kx - Ge~ ikx ), for (x > 0), sodft/dx\ + = ik(F - G), 

I dft/dx = ik(Ae ikx — Be ~ lkx ), for (x < 0), sodty/dx\_ = ik{A — B), 

and hence A (d\J//dx) = ik(F — G — A + B). Meanwhile, = (A + B), so the 
second boundary condition (Equation 2.107) says 

ik(F - G - A + B) = -^-(A + B), [2.116] 

ft 1 


or, more compactly, 

F-G = A{ 1 + lift) - B{\ - 2/£), where = ^. [2.117] 

n z k 

Having imposed the boundary conditions, we are left with two equations (Equa¬ 
tions 2.115 and 2.117) in four unknowns (A, B , F, and G) — five, if you count k. Nor¬ 
malization won’t help—this isn’t a normalizable state. Perhaps we’d better pause, 
then, and examine the physical significance of these various constants. Recall that 
exp(i'£;t) gives rise [when coupled with the time-dependent factor exp(— iEt/fi)\ to 
a wave function propagating to the right, and exp (—ikx) leads to a wave propagating 
to the left. It follows that A (in Equation 2.113) is the amplitude of a wave coming in 
from the left, B is the amplitude of a wave returning to the left, F (in Equation 2.114) 
is the amplitude of a wave traveling off to the right, and G is the amplitude of a wave 
coming in from the right (Figure 2.10). In a typical scattering experiment particles 
are fired in from one direction—let’s say, from the left. In that case the amplitude of 
the wave coming in from the right will be zero: 
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Figure 2.10: Scattering from a 
delta-function well. 


G = 0 (for scattering from the left). [2.118] 

A is then the amplitude of the incident wave, B is the amplitude of the reflected 
wave, and F is the amplitude of the transmitted wave. Solving Equations 2.115 and 
2.117 for B and F, we find 

B = ——— A, F = ~^—A. [2.119] 

l-if l-if 

(If you want to study scattering from the right, set A = 0; then G is the incident 
amplitude, F is the reflected amplitude, and B is the transmitted amplitude.) 

Now, the probability of finding the particle at a specified location is given by 
|^ | 2 , so the relative 25 probability that an incident particle will be reflected back is 


/? _|g| 2 = P 2 

\A\ 2 1+/J2- 


[ 2 . 120 ] 


R is called the reflection coefficient. (If you have a beam of particles, it tells you the 
fraction of the incoming number that will bounce back.) Meanwhile, the probability 
of transmission is given by the transmission coefficient 


t = !£!! = _L_ 

\A\ 2 l+f 2 ' 


[ 2 . 121 ] 


Of course, the sum of these probabilities should be 1—and it is: 


R + T — 


[ 2 . 122 ] 


Notice that R and T are functions of f, and hence (Equations 2.112 and 2.117) of E: 

[2.123] 


R -- 


1 


1 


l + (2 h 2 E/ma 2 ) \ + {ma 2 /2h 2 E) 


25 This is not a normalizable wave function, so the absolute probability of finding the particle at a 
particular location is not well defined; nevertheless, the ratio of probabilities for two different locations is 
meaningful. More on this in the next paragraph. 
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The higher the energy, the greater the probability of transmission (which seems rea¬ 
sonable). 

This is all very tidy, but there is a sticky matter of principle that we cannot al¬ 
together ignore: These scattering wave functions are not normalizable, so they don’t 
actually represent possible particle states. But we know what the resolution to this 
problem is: We must form normalizable linear combinations of the stationary states, 
just as we did for the free particle—true physical particles are represented by the 
resulting wave packets. Though straightforward in principle, this is a messy busi¬ 
ness in practice, and at this point it is best to turn the problem over to a computer. 26 
Meanwhile, since it is impossible to create a normalizable free particle wave function 
without involving a range of energies, R and T should be interpreted as the approxi¬ 
mate reflection and transmission probabilities for particles in a narrow energy range 
about E. Incidentally, it might strike you as peculiar that we were able to analyse a 
quintessential^ time-dependent problem (particle comes in, scatters off a potential, 
and flies off to infinity) using stationary states. After all, if/ (in Equations 2.113 and 
2.114) is simply a complex, time-independent, sinusoidal function, extending (with 
constant amplitude) to infinity in both directions. And yet, by imposing appropriate 
boundary conditions on this function, we were able to determine the probability that a 
particle (represented by a localized wave packet) would bounce off, or pass through, 
the potential. The mathematical miracle behind this is, I suppose, the fact that by 
taking linear combinations of states spread over all space, and with essentially triv¬ 
ial time dependence, we can construct wave functions that are concentrated about a 
(moving) point, with quite elaborate behavior in time (see Problem 2.40). 

As long as we’ve got the relevant equations on the table, let’s look briefly at 
the case of a delta-function barrier (Figure 2.11). Formally, all we have to do is 
change the sign of a. This kills the bound state, of course (see Problem 2.2). On 
the other hand, the reflection and transmission coefficients, which depend only on 
a 2 , are unchanged. Strange to say, the particle is just as likely to pass through the 
barrier as to cross over the well! Classically, of course, the particle could not make 
it over an infinitely high barrier, regardless of its energy. In fact, the classical scattering 



Figure 2.11: The delta-function barrier. 


26 There exist some powerful programs for analysing the scattering of a wave packet from a one¬ 
dimensional potential; see, for instance, A. Goldberg, H. M. Schey, and J. L. Schwartz, Am. J. Phys. 35, 
177 (1967). 
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problem is pretty dull: If E > V maK , then T = 1 and R — 0—the particle certainly 
makes it over; conversely, if E < V max , then T = 0 and R = 1—it rides “up the 
hill” until it runs out of energy, and then returns the same way it came. The quantum 
scattering problem is much richer; the particle has some nonzero probability of passing 
through the potential even if E < V max . We call this phenomenon tunneling; it is 
the mechanism that makes possible much of modem electronics—not to mention 
spectacular recent advances in microscopy. Conversely, even if E > F ma x, there is a 
possibility that the particle will bounce back—though I wouldn’t advise driving off a 
cliff in the expectation that quantum mechanics will save you (see Problem 2.41). 

Problem 2,23 Evaluate the following integrals: 

(a) (x 3 — 3x 2 + lx — 1)5 (x + 2) dx 

(b) / 0 °°[cos(3x) + 2]5(x - 7r) dx 

(c) 1 exp(|x | + 3)5 (x - 2) dx. 

Problem 2.24 Two expressions [D t (x) and D 2 (x)] involving delta functions are 
said to be equal if 


/ +<x> r+oo 

f(x)Di(x)dx = J f(x)D 2 (x) dx, 


for any (ordinary) function fix). 


(a) Show that 

5(cx) = J-5(x), 

M 

where c is a real constant. 

(b) Let 0(x) be the step function: 


*(*) = 


1 , 

0, 


if x >0, 
if x < 0. 


(2.124] 


[2.125] 


[In the rare case where it actually matters, we define 9(0) to be 1/2.] Show that 
dO/dx — S(x). 


❖Problem 2.25 What is the Fourier transform of 5 (x)? Using Plancherel’s theorem, 
show that 


5(x) = 


-f 

2 7T 


X dk. 


[2.126] 


Comment : This formula gives any respectable mathematician apoplexy. Although 
the integral is clearly infinite when x = 0, it doesn’t converge (to zero or anything 
else) when x ^ 0, since the integrand oscillates forever. There are ways to patch it up 
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(for instance, you can integrate from — L to +L, and interpret the integral in Equation 
2.126 to mean the average value of the finite integral, as L oo). The source of the 
problem is that the delta function doesn’t meet the requirement (square integrability) 
for Plancherel’s theorem (see footnote 22). In spite of this. Equation 2.126 can be 
extremely useful, if handled with care. 

♦Problem 2.26 Consider the double delta-function potential 

V (x) — —a[5(x + a) + <5(x — a)], 

where a and a are positive constants. 

(a) Sketch this potential. 

(b) How many bound states does it possess? Find the allowed energies, for a = 
Ji 2 /ma and for a, = h 2 /4ma, and sketch the wave functions. 

♦♦Problem 2.27 Find the transmission coefficient for the potential in Problem 2.26. 


2.6 THE FINITE SQUARE WELL 


As a last example, consider the finite square well 


V(x) = 


-E 0 , 

0 , 


for —a < x < a, 
for \x\ > a, 


[2.127] 


where Vq is a (positive) constant (Figure 2.12). Like the delta-function well, the finite 
square well admits both bound states (with E < 0) and scattering states (with E > 0). 
We’ll look first at the bound states. 

In the region x < —a the potential is zero, so the Schrodinger equation reads 


ft 2 d 2 ir 


d H 2 , 


V(x) 


a 


x 


Figure 2.12: The finite square well 
(Equation 2.127). 
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where 




V -2m E 
h 


[2.128] 


is real and positive. The general solution is \jr{x) = A exp(— kx) + B exp(fcx), but 
the first term blows up (as x —> —oo), so the physically admissable solution (as 
before—see Equation 2.101) is 


\!r(x) = Be KX , for (x < —a). [2.129] 


In the region —a < x < a, V(x) = —V o, and the Schrodinger equation reads 


ft 2 d 2 \fs 
2m dx 2 


- V 0 f = Ef, or —r 


d 2 f 
lx 2 : 


-l 2 f. 


where ____ 

;b V2 m(E+V 0 ) [2.130] 

h 

Although E is negative, for a bound state, it must be greater than -F 0 , by the old 
theorem E > f min (Problem 2.2); so l is also real and positive. The general solution 
is 

tjr(x) = C sin(/x) + Dcos(lx), for (—a < x < a), [2.131] 

where C and D are arbitrary constants. Finally, in the region x > a the potential is 
again zero; the general solution is ir(x) = F exp(-icx) + G exp(xx), but the second 
term blows up (as x oo), so we are left with 

^r(x) = Fe ~ KX , for (x > a). [2.132] 

The next step is to impose boundary conditions: \fr and d\j/jdx continuous at 
—a and +a. But we can save a little time by noting that this potential is an even 
function, so we can assume with no loss of generality that the solutions are either 
even or odd (Problem 2.1c). The advantage of this is that we need only impose the 
boundary conditions on one side (say, at +a); the other side is then automatic, since 
\jr(—x) — ±\fr(x). I’ll work out the even solutions; you get to do the odd ones in 
Problem 2.28. The cosine is even (and the sine is odd), so I’m looking for solutions 
of the form 

( Fe~ KX , for (x > a), 

Dcos(/x), for (0 < x < a), [2.133] 

f(-x), for (x < 0). 

The continuity of ir{x), atx — a, says 

Fe~ Ka = Dcos(la), [2.134] 

and the continuity of dty/dx says 


— KFe Ka = —ID sin(la). 


[2.135] 
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Dividing Equation 2.135 by Equation 2.134, we find that 

k = /tan(/a). [2.136] 


Equation 2.136 is a formula for the allowed energies, since k and / are both 
functions of E. To solve for E, it pays to adopt some nicer notation. Let 

z = la, and z 0 = %^2mV 0 . [2.137] 

Ti 


According to Equations 2.128 and 2.130, (k 2 + l 2 ) = ImVo/fi 2 , so Ka = yfzf—z?, 
and Equation 2.136 reads 

tanz = yf(zo/z ) 2 — L [2.138] 

This is a transcendental equation for z (and hence for E ) as a function of zo (which is 
a measure of the “size” of the well). It can be solved numerically, using a calculator 
or a computer, or graphically, by plotting tan z and y/(zo/z ) 2 — 1 on the same grid, 
and looking for points of intersection (see Figure 2.13). Two limiting cases are of 
special interest: 

1. Wide, deep well. If z.o is very large, the intersections occur just slightly 
below z n = rm/2, with n odd; it follows that 


E n T Vo = 


nVh 2 

2m (2a) 2 


[2.139] 


Here (E + Vo) is the energy above the bottom of the well, and on the right we have 
precisely the infinite square well energies, for a well of width 2 a (see Equation 2.23)— 
or rather, half of them, since n is odd. (The other ones, of course, come from the odd 
wave functions, as you’ll find in Problem 2.28.) So the finite square well goes over to 
the infinite square well, as Vo —> oo; however, for any finite Vo there are only finitely 
many bound states. 

2. Shallow, narrow well. As z 0 decreases, there are fewer and fewer bound 
states, until finally (for zo < n/2, where the lowest odd state disappears) only one 
remains. It is interesting to note, however, that there is always one bound state, no 
matter how “weak” the well becomes. 

You’re welcome to normalize (Equation 2.133), if you’re interested (see 
Problem 2.29), but I’m going to move on now to the scattering states (E > 0). To the 
left, where V (jc) = 0, we have 


fi(x) = Ae ikx + Be~ ikx , for (x < -a), [2.140] 


where (as usual) 



Inside the well, where V (*) = — Vo, 


[2.141] 


fi(x) = C sin(Lc) + Dcos(/jc), for (—a < x < a), 


[2.142] 
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Figure 2.13: Graphical solution to Equation 2.138, for zo = 8 (even states). 


where, as before, 

i _ V2 m(E + Vq) 
— h 


[2.143] 


To the right, assuming there is no incoming wave in this region, we have 

f(x) = Fe ikx . [2.144] 

A is the incident amplitude, B is the reflected amplitude, and F is the transmitted 
amplitude. 27 

There are four boundary conditions: Continuity of f(x) at —a says 

Ae~ lka + Be lka = —C sin(/a) + D cos (la ), [2.145] 

continuity of df/dx at —a gives 

ik[Ae~ ika - Be ika ] = /[Ccos(/fl) + D sin(/a)], [2.146] 

continuity of f(x) at +a yields 

C sin (la) + D cos (la) = Fe ‘ ka , [2.147] 

and continuity of df/dx at +a requires 

l[C cos(/a) — Dsin(/a)] = ikFe lka . [2.148] 

We can use two of these to eliminate C and D, and solve the remaining two for B 
and F (see Problem 2.31): 


sin(2/a) 7 

B = i ^kT (l 


- k z )F, 


[2.149] 


27 We could use even and odd functions, as we did for bound states, but these would represent 
standing waves, and the scattering problem is more naturally formulated in terms of traveling waves. 
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Figure 2.14: Transmission coefficient as a function of energy (Equation 2.151). 


e ~ 2 ika a 

F ~ cos(2 la)-i^^(k 2 +l 2 )' [2 ‘ i5 ° ] 

The transmission coefficient (T = \F\ 2 /\A\ 2 ), expressed in terms of the original 
variables, is given by 

= 1 + 4£(&W Si " 2 12.151] 

Notice that T = 1 (the well becomes “transparent”) whenever the argument of the 
sine is zero, which is to say, for 

2 a j - 

— y/2m(E n + Vq) = rm, [2.152] 


where n is any integer. The energies for perfect transmission, then, are given by 


E n + Vo = 


nVfi 2 

2m (2a) 2 ’ 


[2.153] 


which happen to be precisely the allowed energies for the infinite square well. T is 
plotted in Figure 2.14 as a function of energy. 

❖Problem 2.28 Analyze the odd bound-state wave functions for the finite square 
well. Derive the transcendental equation for the allowed energies, and solve it graph¬ 
ically. Examine the two limiting cases. Is there always at least one odd bound state? 

Problem 2.29 Normalize i p(x) in Equation 2.133 to determine the constants D 
and F. 


Problem 2.30 The Dirac delta function can be thought of as the limiting case of 
a rectangle of area 1, as the height goes to infinity and the width goes to zero. Show 
that the delta-function well (Equation 2.96) is a “weak” potential (even though it is 
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infinitely deep), in the sense that z 0 0. Determine the bound-state energy for 
the delta-function potential, by treating it as the limit of a finite square well. Check 
that your answer is consistent with Equation 2.111. Also show that Equation 2.151 
reduces to Equation 2.123 in the appropriate limit. 

* Problem 2.31 Derive Equations 2.149 and 2.150. Hint: Use Equations 2.147 and 
2.148 to solve for C and D in terms of F: 

C = j^sin(/a) + iy cos(/a)j e ika F\ D — ^cos(/a) - ij sin(/a)j e lka F. 

Plug these back into Equations 2.145 and 2.146. Obtain the transmission coeffi¬ 
cient, and confirm Eqation 2.151. Work out the reflection coefficient, and check that 
T + R= 1. 

** Problem 2.32 Determine the transmission coefficient for a rectangular barrier 
(same as Equation 2.127, only with +F 0 in the region -a < x < a). Treat separately 
the three cases E < V 0 , E = Vo, and E > V 0 (note that the wave function inside the 
barrier is different in the three cases). Partial answer: For E < Co, 28 

T ~' = l + 4 E(vf- i ) Sinh2 (|VWo-£)) ■ 


**Problem 2.33 Consider the step function potential: 


V(x) = 


0 , 

Vo, 


ifx <0, 
if x > 0. 


(a) Calculate the reflection coefficient, for the case E < V 0 , and comment on the 
answer. 

(b) Calculate the reflection coefficient for the case E > V a . 

(c) For a potential such as this that does not go back to zero to the right of the 
barrier, the transmission coefficient is not simply \F\ 2 /\A | 2 , with A the incident 
amplitude and F the transmitted amplitude, because the transmitted wave travels 
at a different speed. Show that 


E- Vo \F\ 2 
E Ml 2 ’ 


[2.154] 


for E > V 0 . Hint: You can figure it out using Equation 2.81, or—more elegantly, 
but less informatively—from the probability current (Problem 1.9a). What is T 
for E < Vo? 


28 This is a good example of tunneling —classically the particle would bounce back. 
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(d) For E > Vo, calculate the transmission coefficient for the step potential, and 
check that T + R = 1. 


2.7 THE SCATTERING MATRIX _ 

The theory of scattering generalizes in a pretty obvious way to arbitrary localized 
potentials (Figure 2.15). To the left (Region I), V (x) = 0, so 

f(x) = Ae ikx + Be~ ilx , where k = 22EA, [2.155] 

7i 

To the right (Region III), V (x) is again zero, so 

VKx) = Fe ikx + Ge~ ikx . [2.156] 

In between (Region II), of course, I can’t tell you what \fr is until you specify the 
potential, but because the Schrodinger equation is a linear, second-order differential 
equation, the general solution has got to be of the form 

Vr(x) = C/(x)+Z>g(x), [2.157] 

where /(x) and g(x) are any two linearly independent particular solutions. 29 There 
will be four boundary conditions (two joining Regions I and II, and two joining 
Regions II and III). Two of these can be used to eliminate C and D, and the other two 
can be “solved” for B and F in terms of A and G : 

B = S[\A + S\ 2 G, F = S 2 [ A + S 22 G. [2.158] 

The four coefficients Sjj, which depend on k (and hence on E), constitute a 
2x2 matrix 



Figure 2.15: Scattering from an arbitrary localized potential {V(x) = 0 except in 
Region II]. 


29 See any book on differential equations—for example, J. L. Van Iwaarden, Ordinary Differential 
Equations with Numerical Techniques (San Diego, CA: Harcourt Brace Jovanovich, 1985). Chapter 3. 
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S = 


'S n 

V S 21 


Sn\ 

S 22 ) 


[2.159] 


called the scattering matrix (or S-matrix, for short). The S-matrix tells you the 
outgoing amplitudes (B and F) in terms of the incoming amplitudes (A and G): 


(f)= s (g)- [2160] 

In the typical case of scattering from the left, G = 0, so the reflection and transmission 
coefficients are 


fi( = {^| =|Sn| 2 , 

\A l 2 Ig=o 

For scattering from the right, 4=0, and 

fir = Si =l%2| 2 . 

|G| 2 \a=o 


r, = lg = l^ 2 . 

\A I 2 G= 0 


r 15,2 K l 2 

T r = 77^ „ = Pl2l 

|G| 2 A= 0 


[2.161] 


[2.162] 


The S-matrix tells you everything there is to know about scattering from a local¬ 
ized potential. Surprisingly, it also contains (albeit in a concealed form) information 
about the bound states (if there are any). For if E < 0, then f(x) has the form 


( Be KX (Region I), 

Cf(x) + Dg(x ) (Region II), [2.163] 

Fe~ KX (Region III), 


with 


K 


V—2 mE 
h 


[2.164] 


The boundary conditions are the same as before, so the S-matrix has the same 
structure—only now E is negative, so k —» itc. But this time A and G are nec¬ 
essarily zero, whereas B and F are not, and hence (Equation 2.158) at least two 
elements in the S-matrix must be infinite. To put it the other way around, if you’ve 
got the S-matrix (for E > 0), and you want to locate the bound states, put in k ix, 
and look for energies at which the S-matrix blows up. 

For example, in the case of the finite square well, 


S 21 


ms(2la)-i^^(k 1 + P) 


(Equation 2.150). Substituting k -> ix, we see that S 2] blows up whenever 


cot(2/a) = 


1 2 -k 2 
2 xl 
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Using the trigonometric identity 

tan = ±\/1 + cot 2 9 - cot 9, 

we obtain 

/ K 

tan {la) = — (plus sign), and cot (la) = -- (minus sign). 

These are precisely the conditions for bound states of the finite square well (Equation 
2.136 and Problem 2.28). 


❖Problem 2.34 Construct the ^-matrix for scattering from a delta-function well 
(Equation 2.96). Use it to obtain the bound state energy, and check your answer 
against Equation 2.111. 

Problem 2.35 Find the S-matrix for the finite square well (Equation 2.127). Hint: 
This requires no new work if you carefully exploit the symmetry of the problem. 


FURTHER PROBLEMS FOR CHAPTER 2 _ 

Problem 2.36 A particle in the infinite square well (Equation 2.15) has the initial 
wave function 

\p(x, 0) = A sin 3 (^x/a). 

Find (x) as a function of time. 


❖Problem 2.37 Find (x), (p), (X 2 ), (p 2 ), ( T ), and (F(x)) for the nth stationary 
state of the harmonic oscillator. Check that the uncertainty principle is satisfied. 
Hint: Express x and (ft/i)(djdx) in terms of {a + ± aJ), and use Equations 2.52 and 
2.53; you may assume that the states are orthogonal. 

Problem 2.38 Find the allowed energies of the half-harmonic oscillator 

_ I (1/2 )ma) 2 x 2 , for (x > 0), 

[ oo, for (x < 0). 

(This represents, for example, a spring that can be stretched, but not compressed.) 
Hint: This requires some careful thought, but very little actual computation. 

❖❖Problem 2.39 Solve the time-independent Schrodinger equation for an infinite 
square well with a delta-function barrier at the center; 


V(x) = 


ocS(x), for (—a < x < -j -a), 
oo, for (|x| > a). 
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Treat the even and odd wave functions separately. Don’t bother to normalize them. 
Find the allowed energies (graphically, if necessary). How do they compare with the 
corresponding energies in the absence of the delta function? Comment on the limiting 
cases o' —> 0 and a oo. 

**Problem 2.40 In Problem 2.22 you analyzed the stationary Gaussian free particle 
wave packet. Now solve the same problem for the traveling Gaussian wave packet, 
starting with the initial wave function 

*(jc,0 ) = Ae~ axl e ilx , 


where / is a real constant. 

Problem 2.41 A particle of mass m and kinetic energy E > 0 approaches an 
abrupt potential drop Vo (Figure 2.16). 

(a) What is the probability that it will “reflect” back, if E = Vo/3? 

(b) I drew the figure so as to make you think of a car approaching a cliff, but 
obviously the probability of “bouncing back” from the edge of a cliff is far 
smaller than what you got in (a)—unless you’re Bugs Bunny. Explain why this 
potential does not correctly represent a cliff. 

Problem 2.42 If two (or more) distinct 30 solutions to the (time-independent) Schro- 
dinger equation have the same energy E, these states are said to be degenerate. For 
example, the free particle states are doubly degenerate—one solution representing 
motion to the right, and the other motion to the left. But we have encountered no 



Figure 2.16: Scattering from a “cliff” (Problem 2.41). 


30 If the two solutions differ only by a multiplicative constant (so that, once normalized, they differ 
only by a phase factor e 1 ^), they represent the same physical state, and in this case they are not distinct 
solutions. Technically, by “distinct” I mean “linearly independent.” 
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normalizable degenerate solutions, and this is not an accident. Prove the following 
theorem: In one dimension 31 there are no degenerate bound states. Hint : Suppose 
there are two solutions, fa and fa, with the same energy E. Multiply the Schrodinger 
equation for fa by fa, and the Schrodinger equation for fa by xj/[, and subtract, to 
show that (fad^i/dx — fadfa/dx) is a constant. Use the fact that for normalizable 
solutions \fr -> 0 at ±00 to demonstrate that this constant is in fact zero. Conclude 
that fa is a multiple of fa , and hence that the two solutions are not distinct. 

Problem 2.43 Imagine a bead of mass m that slides frictionlessly around a circular 
wire ring of circumference a, [This is just like a free particle, except that \f/(x) — 
\jr{x+ a).] Find the stationary states (with appropriate normalization) and the cor¬ 
responding allowed energies. Note that there are two independent solutions for each 
energy E n —corresponding to clockwise and counterclockwise circulation; call them 
fn (*) f~{x). How do you account for this degeneracy, in view of the theorem 

in Problem 2.42—that is, why does the theorem fail in this case? 

** Problem 2.44 (, Attention : This is a strictly qualitative problem—no calculations 
allowed!) Consider the “double square well” potential (Figure 2.17). Suppose the 
depth Vo and the width a are fixed, and great enough so that several bound states 
occur. 

(a) Sketch the ground-state wave function fa and the first excited state fa, (i) for 
the case b = 0, (ii) for b « a, and (iii) for b » a, 

(b) Qualitatively, how do the corresponding energies (£j and E 2 ) vary, as b goes 
from 0 to 00 ? Sketch E\ (b) and E 2 {b) on the same graph. 

(c) The double well is a very primitive one-dimensional model for the potential 
experienced by an electron in a diatomic molecule (the two wells represent the 
attractive force of the nuclei). If the nuclei are free to move, they will adopt the 
configuration of minimum energy. In view of your conclusions in (b), does the 
electron tend to draw the nuclei together, or push them apart? (Of course, there 
is also the intemuclear repulsion to consider, but that’s a separate problem.) 


***Problem 2.45 

(a) Show that 

x / mco\ l/4 T mo) / 0 a 2 0 . . ifit . A1 

^ 0 = Ur) exp [-» ( x + 2 ' 11+8 > + ^ ~ 2flxe_ ' )J 


31 In higher dimensions such degeneracy is very common, as we shall see in Chapter 4 . Assume that 
the potential does not consist of isolated pieces separated by regions where V = 00 —two isolated infinite 
square wells, for instance, would give rise to degenerate bound states, for which the particle is either in the 
one or in the other. 
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Figure 2.17: The double square well (Problem 2.44). 


satisfies the time-dependent Schrodinger equation for the harmonic oscillator 
potential (Equation 2.38). Here a is any real constant (with the dimensions of 
length). 32 

(b) Find ^(x, t)\ 2 , and describe the motion of the wave packet. 

(c) Compute (x) and (p), and check that Ehrenfest’s theorem (Equation 1.38) is 
satisfied. 

Problem 2.46 Consider the potential 

V(x) = ( itx < °’ 

w a), ifx>0, 

where a and a are positive real constants with the appropriate units (see Figure 2.18). 

A particle starts out in the “well” (0 < x < a), but because of tunneling its wave 

function gradually “leaks” out through the delta-function barrier. 

(a) Solve the (time-independent) Schrodinger equation for this potential; impose 
appropriate boundary conditions, and determine the “energy”, E. (An implicit 
equation will do.) 

(b) I put the word “energy” in quotes because you’ll notice that it is a complex 
numberl How do you account for this, in view of the theorem you proved in 
Problem 2.1a? 

(c) Writing E = E 0 + zT (with Eq and V real), calculate (in terms of T) the 
characteristic time it takes the particle to leak out of the well (that is, the time it 
takes before the probability is 1 je that it’s still in the region 0 < x < a). 


32 This rare example of an exact closed-form solution to the time-dependent Schrodinger equation 
was discovered by Schrodinger himself, in 1926. 
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Figure 2.18: The potential for Problem 2.46. 


**Problem 2.47 Consider the moving delta-function well: 

V (x, t) — —aS(x — vt), 
where v is the (constant) velocity of the well. 

(a) Show that the time-dependent Schrodinger equation admits the exact solution 

= ^^L e -mot\x-vt\/fi 2 e -i[(E+(\/2)mv 2 )t-mvx]/ti , 

h 

where E = ~ma 2 /7h 2 is the bound-state energy of the stationary delta function. 
Hint : Plug it in and check it! Use Problem 2.24b. 

(b) Find the expectation value of the Hamiltonian in this state, and comment on the 
result. 


***Problem 2.48 Consider the potential 

h 2 a 2 2 

V (x) = -sech (ax), 

m 

where a is a positive constant and “sech” stands for the hyperbolic secant. 


(a) Show that this potential has the bound state 

i ko(x) — A sech(ax), 

and find its energy. Normalize , and sketch its graph. 

(b) Show that the function 


f k (x) - A 


( ik — a tanh(<zx)\ 
ik + a ) 


} ikx 
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(where k = -Jim E/ft, as usual) solves the Schrodinger equation for any (posi¬ 
tive) energy E. Since tanhz —— 1 as z —>• — oo, 

f k (x)^ Ae lkx , for large negative x. 

This represents, then, a wave coming in from the left with no accompanying 
reflected wave [i.e., no term exp(—ifac)]. What is the asymptotic form of flk(F) 
at large positive x ? What are R and T for this potential? Note : sech 2 is a famous 
example of a “reflectionless” potential—every incident particle, regardless of 
its energy, passes right through. See R. E. Crandall and B. R. Litt, Annals of 
Physics 146,458 (1983). 

(c) Construct the S-matrix for this potential, and use it to locate the bound states. 
How many of them are there? What are their energies? Check that your answer 
is consistent with part (a). 

**= 1 = Problem 2.49 The S-matrix tells you the outgoing amplitudes (B and F) in terms 
of the incoming amplitudes (A and G): 

(* F )=( s f 

\F J \S2i S 22 J \ G J 

For some purposes it is more convenient to work with the transfer matrix, M, which 
gives you the amplitudes to the right of the potential (F and G) in terms of those to 
the left (A and B): 

(F\_(M n M i2 \(A\ 

\m 2 , m 22 )\b)- 

(a) Find the four elements of the M-matrix in terms of the elements of the 5-matrix, 
and vice versa. Express R}, Tj, R r , and T r (Equations 2.161 and 2.162) in terms 
of elements of the M-matrix. 

(b) Suppose you have a potential consisting of two isolated pieces (Figure 2.19). 
Show that the M-matrix for the combination is the product of the two M-matrices 
for each section separately: 

M =3 M 2 M]. 

(This obviously generalizes to any number of pieces, and accounts for the use¬ 
fulness of the M-matrix.) 


V=Q 


l /=0 


V=0 


Figure 2.19: A potential consisting of two isolated pieces (Problem 2.49). 
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(c) 


Construct the M-matrix for scattering from a single delta-function potential at 
point a : 

V (x) = — a^x — a). 


(d) By the method of part (b), find the M-matrix for scattering from the double delta 
function 

V (x) — —of[5(x + a) + <5(x — «)]. 

What is the transmission coefficient for this potential? 



CHAPTER 3 


FORMALISM 


3.1 LINEAR ALGEBRA 


The purpose of this chapter is to develop the formalism of quantum mechanics— 
terminology, notation, and mathematical background that illuminate the structure of 
the theory, facilitate practical calculations, and motivate a fundamental extension of 
the statistical interpretation. I begin with a brief survey of linear algebra. 1 Linear 
algebra abstracts and generalizes the arithmetic of ordinary vectors, as we encounter 
them in first-year physics. The generalization is in two directions: (1) We allow 
the scalars to be complex, and (2) we do not restrict ourselves to three dimensions 
(indeed, in Section 3.2 we shall be working with vectors that live in spaces of infinite 
dimension). 

3.1.1 Vectors 

A vector space consists of a set of vectors (la), | (5), \y), ...), together with a set 
of scalars (a, b, c, .. .), 2 which are subject to two operations—vector addition and 
scalar multiplication: 


1 If you have already studied linear algebra, you should be able to skim this section quickly, but 
I wouldn’t skip it altogether, because some of the notation may be unfamiliar. If, on the other hand, this 
material is new to you, be warned that I am only summarizing (often without proof) those aspects of the 
theory we will be needing later. For details, you should refer to a text on linear algebra, such as the classic 
by P. R. Halmos: Finite Dimensional Vector Spaces , 2nd ed. (Princeton, NJ: van Nostrand, 1958). 

2 For our purposes, the scalars will be ordinary complex numbers. Mathematicians can tell you 
about vector spaces over more exotic fields, but such objects play no role in quantum mechanics. 
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Vector addition. The “sum” of any two vectors is another vector: 

l«) + 1/5) = Ik). [3.1] 

Vector addition is commutative 

|a> + \P) = \P) + |«), [3.2] 

and associative 

l«) + (l^) + lK))-(l«) + l^)) + lK). [3.3] 

There exists a zero (or null) vector, 3 |0), with the property that 

|of> + |0) = |of>, [3.4] 

for every vector |a). And for every vector \a) there is an associated inverse vector 
(| — a)), such that 

|or> + | — or) = |0). [3.5] 

Scalar multiplication. The “product” of any scalar with any vector is another 
vector: 

a\a) = | y). [3.6] 

Scalar multiplication is distributive with respect to vector addition 

a(\a) + \P))=a\a)+a\P) [3.7] 

and with respect to scalar addition 

(a + b)\a) = a\a) + b\a). [3.8] 

It is also associative with respect to the ordinary multiplication of scalars: 

a(b\a)) = (ab) |a>. [3.9] 

Multiplication by the scalars 0 and 1 has the effect you would expect: 

0|of) — |0); 11 of) — |a). [3.10] 

Evidently | - a) = (—l)|a). 

There’s a lot less here than meets the eye—all I have done is to write down 
in abstract language the familiar rules for manipulating vectors. The virtue of such 
abstraction is that we will be able to apply our knowledge and intuition about the 
behavior of ordinary vectors to other systems that happen to share the same formal 
properties. 


3 It is customary, where no confusion can arise, to write the null vector without the adorning bracket: 
i0) 0. 
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A linear combination of the vectors |«), \{i), |y), ... is an expression of the 

form 

a\a)+b\p)+c\ Y ) + ---. [3.11] 

A vector |A) is said to be linearly independent of the set |a), Jy3), \ y), ... if it cannot 
be written as a linear combination of them. (For example, in three dimensions the unit 
vector k is linearly independent of i and j, but any vector in the Jtv-plane is linearly 
dependent on i and j.) By extension, a set of vectors is linearly independent if each 
one is linearly independent of all the rest. A collection of vectors is said to span the 
space if every vector can be written as a linear combination of the members of this 
set. 4 A set of linearly independent vectors that spans the space is called a basis. The 
number of vectors in any basis is called the dimension of the space. For the moment 
we shall assume that the dimension (n) is finite. 

With respect to a prescribed basis 

Id), Id), Id), [3.12] 

any given vector 

l°0 = a\ 1^!) + a 2 \e 2 } + ■ • • + a n \e n ) [3.13] 

is uniquely represented by the (ordered) n -tuple of its components: 

| ct)<*(a u a 2 ,...,a n ). [3.14] 

It is often easier to work with the components than with the abstract vectors them¬ 
selves. To add vectors, you add their corresponding components: 

\a) + \p) (fli + b u a 2 + b 2 , ..., a n + b n )\ [3.15] 

to multiply by a scalar you multiply each component: 

c\a) (ca u ca 2 , . .., ca„); [3.16] 

the null vector is represented by a string of zeroes: 

|0)**(0,0,...,0); [3.17] 

and the components of the inverse vector have their signs reversed: 

| -a) [3.18] 

The only disadvantage of working with components is that you have to commit your¬ 
self to a particular basis, and the same manipulations will look very different to 
someone working in a different basis. 

Problem 3.1 Consider the ordinary vectors in three dimensions ( a x i + a y j + a z k) 
with complex components. 


4 A set of vectors that spans the space is also called complete, though I personally reserve that word 
for the infinite-dimensional case, where subtle questions of convergence arise. 
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(a) Does the subset of all vectors with a z = 0 constitute a vector space? If so, what 
is its dimension; if not, why not? 

(b) What about the subset of all vectors whose z component is 1? 

(c) How about the subset of vectors whose components are all equal? 

❖Problem 3.2 Consider the collection of all polynomials (with complex coefficients) 
of degree < N in x. 

(a) Does this set constitute a vector space (with the polynomials as “vectors”)? If 
so, suggest a convenient basis, and give the dimension of the space. If not, 
which of the defining properties does it lack? 

(b) What if we require that the polynomials be even functions? 

(C) What if we require that the leading coefficient (i.e., the number multiplying 
x^ -1 ) be 1? 

(d) What if we require that the polynomials have the value 0 at x = 1? 

(e) What if we require that the polynomials have the value 1 at x = 0? 

Problem 3.3 Prove that the components of a vector with respect to a given basis 
are unique. 


3.1.2 Inner Products 


In three dimensions we encounter two kinds of vector products: the dot product and 
the cross product. The latter does not generalize in any natural way to n -dimensional 
vector spaces, but the former does —in this context it is usually called the inner 
product. The inner product of two vectors (|a) and \fi)) is a complex number (which 
we write as {ot\ft)), with the following properties: 


</S| «> = W, 

[3.19] 

{ala) > 0, and {a|a) = 0 O |a) = |0), 

[3.20] 

{a\(b\fi) +c|y)) = b{a\P) +c(a\y). 

[3.21] 


Apart from the generalization to complex numbers, these axioms simply codify the 
familiar behavior of dot products. A vector space with an inner product is called an 

inner product space. 

Because the inner product of any vector with itself is a nonnegative number 
(Equation 3.20), its square root is real —we call this the norm of the vector: 

M = y/ <«!«>; 


[3.22] 
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it generalizes the notion of “length”. A “unit” vector, whose norm is 1, is said to 
be normalized (the word should really be “normal”, but I guess that sounds too 
anthropomorphic). Two vectors whose inner product is zero are called orthogonal 
(generalizing the notion of “perpendicular”). A collection of mutually orthogonal 
normalized vectors, 

(a i \a J )=8 lj , [3.23] 

is called an orthonormal set. It is always possible (see Problem 3.4), and almost 
always convenient, to choose an orthonormal basis', in that case the inner product of 
two vectors can be written very neatly in terms of their components: 

(a\P) = a\b\ + 0^2 A - \~a*b n , [3.24] 

the norm (squared) becomes 

(a|a) = |a 1 | 2 + |fl 2 | 2 + -.- + K| 2 , [3.25] 

and the components themselves are 


a t = <e,|a>. 


[3.26] 


(These results generalize the familiar formulas a • b — a x b x + a y b y + a z b z , a • a 
= al + a 2 y + a 2 , and a x — i • a, a y = j • a, a z = k ■ a, for the three-dimensional 
orthonormal basis i, j, k.) From now on we shall always work in orthonormal bases 
unless it is explicitly indicated otherwise. 

Another geometrical quantity one might wish to generalize is the angle between 
two vectors. In ordinary vector analysis cos 9 = (a • b)/|a| |b|. But because the inner 
product is in general a complex number, the analogous formula (in an arbitrary inner 
product space) does not define a (real) angle 0. Nevertheless, it is still true that the 
absolute value of this quantity is a number no greater than 1, 

ItaliS)! 2 < (<*|o<)W>. [3.27] 


(This important result is known as the Schwarz inequality; the proof is given in 
Problem 3.5.) So you can, if you like, define the angle between |a) and \fi) by the 
formula _ 


cosf? — 


{ot\ot)mv 


[3.28] 


*Problem 3.4 Suppose you start out with a basis {\e\), \e 2 ), ..., \e»)) that is not 
orthonormal. The Gram-Schmidt procedure is a systematic ritual for generating 
from it an orthonormal basis (\e[), \e' 2 ), ..., \e' n )). It goes like this: 

(i) Normalize the first basis vector (divide by its norm): 
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(ii) Find the projection of the second vector along the first, and subtract it off: 

\e 2 ) ~ {e[ WWi) • 

This vector is orthogonal to \e\)\ normalize it to get \e r 2 ). 

(iii) Subtract from |e 3 ) its projections along \e\) and \e' 2 ) : 

|<?3> - <e'ile3)ki) - (4^3)14)- 

This is orthogonal to \e\) and \e' 2 )\ normalize it to get \e^). And so on. 

Use the Gram-Schmidt procedure to orthonormalize the three-space basis 

\e\) = (l+i)i+(l)j + (i)k, M = (i)f+ (3)j + (1)4 k 3 ) = (0)f + (28)y+(0)£. 

Problem 3.5 Prove the Schwarz inequality (Equation 3.27). Hint: Let | y) = 
\p) - ({ct\P)/(ct\a))\ct), and use {y\y) > 0. 

Problem 3.6 Find the an^le (in the sense of Equation 3.28) between the vectors 
| a) = (1 + i)i + (1)7 + (i)k and |/8) = (4 - i)i + (0)y + (2 - 2 i)k. 

Problem 3.7 Prove the triangle inequality: ||(|a) + ]/8))|| < ll«tl + 


3.1.3 Linear Transformations 

Suppose you take every vector (in three-space) and multiply it by 17, or you rotate 
every vector by 39° about the 2 -axis, or you reflect every vector in the xy-plane—these 
are all examples of linear transformations. A linear transformation 5 (T) takes each 
vector in a vector space and “transforms” it into some other vector (|a) \a') = 

f |a)), with the proviso that the operation is linear: 

f(a\a) + b\P)) = a(f\a)) + b(f\0)), [3.29] 

for any vectors |«), \fi) and any scalars a, b. 

If you know what a particular linear transformation does to a set of basis vectors, 
you can easily figure out what it does to any vector. For suppose that 


T\e i) 

= T] i |ei) + T 2 i\e 2 ) + • • 

• + r„,|e„>, 

T\ei) 

= T]2|ei) + T 2 2|e2) + •' 

• + T n2 \e n ), 

T\e n ) 

= T ]n \ei) + T 2 n \e 2 ) F ■ 

• • + T„„\e„), 


or, more compactly, 

f\ e J ) = y2 T ij\e,) ) 0' = l,2,...,n). [3.30] 

1 = 1 


5 In this chapter I’ll use a hat (to denote linear transformations; this is not inconsistent with my 
earlier convention (putting hats on operators), for (as we shall see) our operators are linear transformations. 
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If |a) is an arbitrary vector: 


l«> = «iki) +a 2 \e 2 ) + • ■ -+a n \e n ) - 'Y^a j \e j ), [3.31] 

y=i 

then 

f\ a ) = j2«Af\‘ j)) = 'EY.Wj-m = E(Ei 3 - 32 ! 

7 = 1 7=1 i = l r = l 7 = 1 

Evidently T takes a vector with components a u a 2 ,... ,a„ into a vector with compo¬ 
nents 6 


= E I 3 - 33 ! 

7 = 1 

Thus the n 2 elements 7) y uniquely characterize the linear transformation f (with 
respect to a given basis), just as the n components n, uniquely characterize the vector 
I a) (with respect to the same basis): 

f*+(T u ,T l2 , ...,T nn ). [3.34] 

If the basis is orthonormal, it follows from Equation 3.30 that 

7i 7 = { e ,|Tk y ). [3.35] 

It is convenient to display these complex numbers in the form of a matrix 7 : 

Tn T {2 
T 21 t 22 

T n 1 T n2 

The study of linear transformations, then, reduces to the theory of matrices. The sum 
of two linear transformations (S + T) is defined in the natural way: 

(S+T)\a) = S\a) + f\a)-, [3.37] 

this matches the usual rule for adding matrices (you add their corresponding elements): 

U = S + T &Uij = Sij + Tij. [3.38] 

6 Notice the reversal of indices between Equations 3.30 and 3.33. This is not a typographical error. 
Another way of putting it (switching i j in Equation 3.30) is that if the components transform with T i h 
the basis vectors transform with 7),. 

7 ril use boldface to denote matrices. 
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The product of two linear transformations (ST) is the net effect of performing them 
in succession—first T, then S\ 


|a) |a') = T\a) -► \a") = S\a') = S(T\a )) = ST\a). [3.39] 

What matrix U represents the combined transformation U — ST! It’s not hard to 
work it out: 




(x T j ka k) - X(x ^) ak — X 

\*=l / k= 1 \J =l / k= 1 


UikCik- 


Evidently 

u = st c/,-* = X s u T Jk ; [3 - 40] 

7 = 1 

this is the standard rule for matrix multiplication—to find the ik th element of the 
product, you look at the i th row of S and the k lh column of T, multiply corresponding 
entries, and add. The same procedure allows you to multiply rectangular matrices, as 
long as the number of columns in the first matches the number of rows in the second. 
In particular, if we write the n -tuple of components of |oc) as an n x 1 column matrix 

( a i\ 

as a ! , [3.41] 

\a n ) 

the transformation rule (Equation 3.33) can be written 

a' = Ta. [3-42] 


And now, some useful matrix terminology: The transpose of a matrix (which 
we shall write with a tilde: T) is the same set of elements, but with rows and columns 
interchanged: 

(T n T 2 i ... T nl \ 

T\2 T 22 ... T ,,2 


T = 


J 


\ T,„ T 2 „ ... T„„ 

Notice that the transpose of a column matrix is a row matrix: 


i = (<*i «2 


a n ). 


[3.43] 


[3.44] 


A square matrix is symmetric if it is equal to its transpose (reflection in the main 
diagonal—upper left to lower right—leaves it unchanged); it is antisymmetric if 
this operation reverses the sign: 

SYMMETRIC: T = T; ANTISYMMETRIC: T = -T. [3.45] 
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To construct the (complex) conjugate of a matrix (which we denote, as usual, with 
an asterisk: T*), you take the complex conjugate of every element: 



(V Vi • 

■ V\ 

T* = 

V\ V 2 ■ 

• Vn 


Wi Vi ■ 

■ ■ T*n ' 


a* 

a* 


[3.46] 


A matrix is real if all its elements are real and imaginary if they are all imaginary: 


REAL: T* = T; IMAGINARY: T* = -T. [3.47] 


The Hermitian conjugate (or adjoint) of a matrix (indicated by a dagger: T t ) is the 
transposed conjugate: 


T+ = T* = 


'*n T 2 * 

V V 


■■ V\ 

Vi 

• • T* ) 




a*)- [3.48] 


A square matrix is Hermitian (or self-adjoint) if it is equal to its Hermitian conjugate; 
if Hermitian conjugation introduces a minus sign, the matrix is skew Hermitian (or 
anti-Hermitian): 


HERMITIAN: V = T; SKEW HERMITIAN: V = -T. [3.49] 

With this notation the inner product of two vectors (with respect to an orthonormal 
basis—Equation 3.24), can be written very neatly in matrix form: 

{ct\fi) = a f b. [3.50] 

(Notice that each of the three operations discussed in this paragraph, if applied twice, 
returns you to the original matrix.) 

Matrix multiplication is not, in general, commutative (ST ^ TS); the difference 
between the two orderings is called the commutator: 


[S, T] = ST - TS. 


[3.51] 


The transpose of a product is the product of the transposes in reverse order. 

(ST) = TS [3.52] 

(see Problem 3.12), and the same goes for Hermitian conjugates: 

(ST)t = VsK 


[3.53] 
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The unit matrix (representing a linear transformation that carries every vector 
into itself) consists of ones on the main diagonal and zeroes everywhere else: 

1 0 
0 1 

0 0 

In other words, 

1 U ~ 

The inverse of a matrix (written T -1 ) is defined in the obvious way: 

T -1 T = TT -1 = 1. [3.56] 



::: 


... i/ 

--Sti¬ 


tt-54] 


[3.55] 


A matrix has an inverse if and only if its determinant 8 is nonzero; in fact. 


T -i 


— C, 

detT 


[3.57] 


where C is the matrix of cofactors [the cofactor of element 7},- is (-1)' +J times 
the determinant of the submatrix obtained from T by erasing the / th row and the j th 
column]. A matrix without an inverse is said to be singular. The inverse of a product 
(assuming it exists) is the product of the inverses in reverse order : 

(ST) -1 = T -1 S -1 . [3.58] 


A matrix is unitary if its inverse is equal to its Hermitian conjugate: 

UNITARY : = U -t . [3.59] 


Assuming the basis is orthonormal, the columns of a unitary matrix constitute an 
orthonormal set, and so too do its rows (see Problem 3.16). 

The components of a given vector depend on your (arbitrary) choice of basis, 
as do the elements in the matrix representing a given linear transformation. We might 
inquire how these numbers change when we switch to a different basis. The old basis 
vectors \e t ) are—like all vectors—linear combinations of the new ones: 


\ e \) 

= 5ii|/i) + 52i I/ 2 ) + • • 

• + 5„i|/„), 

\e 2 ) 

= 5 12 |/l) + 5-22 1 / 2 ) + • ■ 

• + 5„ 2 | fn). 


= 5 ln |/l) + S 2 n\h) + ■ 

■ • + S nn \f n ) 


8 I assume you know how to evaluate determinants. If not, see M. Boas, Mathematical Methods in 
the Physical Sciences, 2nd ed. (New York: John Wiley, 1983), Section 3.3. 
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(for some set of complex numbers S,/), or, more compactly, 

l*v> = I>yl/ ; >. 0' = 1 . 2 ,. 


[3.60] 


This is itself a linear transformation (compare Equation 3.30), 9 and we know imme¬ 
diately how the components transform: 

a i = ^ SijOj [3.61] 

J = 1 

(where the superscript indicates the basis). In matrix form 

a / = Sa". [3.62] 

What about the matrix representing a given linear transformation T —how is it 
modified by a change of basis? In the old basis we had (Equation 3.42) 

a c ' = TV, 

and Equation 3.62—multiplying both sides by S" 1 —entails 10 a e = S "W, so 
a f = Sa e/ = S(T e a e ) = ST e S“ V. 


Evidently 

T / =ST e S“ 1 . [3.63] 

In general, two matrices (Ti and T 2 ) are said to be similar if T 2 = ST|S _1 for some 
(nonsingular) matrix S. What we have just found is that similar matrices represent 
the same linear transformation with respect to two different bases. Incidentally, if 
the first basis is orthonormal, the second will also be orthonormal if and only if the 
matrix S is unitary (see Problem 3.14). Since we always work in orthonormal bases, 
we are interested mainly in unitary similarity transformations. 

While the elements of the matrix representing a given linear transformation 
may look very different in the new basis, two numbers associated with the matrix are 
unchanged: the determinant and the trace. For the determinant of a product is the 
product of the determinants, and hence 

det(T / ) = det(S T e S _1 ) = det(S) det(T e ) det(S _1 ) = detT e . [3.64] 


9 Notice, however, the radically different perspective: In this case we’re talking about one and the 
same vector, referred to two different bases, whereas before we were thinking of a completely different 
vector, referred to the same basis. 

10 Note that S -1 certainly exists—if S were singular, the |/, )’s would not span the space, so they 
wouldn’t constitute a basis, 
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And the trace, which is the sum of the diagonal elements, 

m 

Tr(T) = ^7},, [3.65] 

1 = 1 


has the property (see Problem 3.15) that 

Tr(T]T 2 ) = Tr(T 2 T,), [3.66] 

(for any two matrices Ti and T 2 ), so that 

Tr(T / ) = Tr(ST‘ ! S- 1 ) = Tr(T*S _I S) = Tr(T c ). [3.67] 

Problem 3.8 Using the standard basis (f, j, k) for vectors in three dimensions: 

(a) Construct the matrix representing a rotation through angle 9 (counterclockwise, 
looking down the axis toward the origin) about the 2 -axis. 

(b) Construct the matrix representing a rotation by 120° (counterclockwise, looking 
down the axis) about an axis through the point (1,1,1). 

(c) Construct the matrix representing reflection in the xy-plane. 

(d) Are translations (x -* x + x 0 , y -* y + yo, 2^2 + z Ch for some constants 
x 0 , >b, 2 0 ) linear transformations? If so, find the matrix which represents them; 
if not, explain why not. 

❖Problem 3.9 Given the following two matrices: 

/-I 1 i\ f2 0 -i\ 

A =I 2 0 3, B= 0 1 0 , 

\2i -2 i 2/ \i 3 2/ 

compute (a) A + B, (b) AB, (c) [A, B], (d) A, (e) A*, (f) A*, (g) Tr(B), (h) det(B), 
and (i) B _1 . Check that BB -1 = 1. Does A have an inverse? 

❖Problem 3.10 Using the square matrices in Problem 3.9 and the column matrices 

■-(?)• i -(v)' 

find (a) Aa, (b) a^b, (c) aBb, (d) ab^ 

Problem 3.11 By explicit construction of the matrices in question, show that any 
matrix T can be written 


(a) as the sum of a symmetric matrix S and an antisymmetric matrix A; 
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(b) as the sum of a real matrix R and an imaginary matrix I; 

(c) as the sum of a Hermitian matrix H and a skew-Hermitian matrix K. 

^Problem 3.12 Prove Equations 3.52, 3.53, and 3.58. Show that the product of two 
unitary matrices is unitary. Under what conditions is the product of two Hermitian 
matrices Hermitian? Is the sum of two unitary matrices unitary? Is the sum of two 
Hermitian matrices Hermitian? 

Problem 3.13 In the usual basis (i, j, k), construct the matrix T x representing a 
rotation through angle 9 about the x-axis, and the matrix T v representing a rotation 
through angle 9 about the y-axis. Suppose now we change bases, to V = ],]' = 
—i, k — k. Construct the matrix S that effects this change of basis, and check that 
ST X S _1 and ST^S -1 are what you would expect. 

Problem 3.14 Show that similarity preserves matrix multiplication (that is if 
A e B e — C e , then A^B f = C^). Similarity does not, in general, preserve symmetry, 
reality, or Hermiticity; show, however, that if S is unitary , and H e is Hermitian, then 
H / is Hermitian. Show that S carries an orthonormal basis into another orthonormal 
basis if and only if it is unitary. 

^Problem 3.15 Prove that Tr(TiT 2 ) = Tr(T 2 T ( ). It follows immediately that 
Tr(TiT 2 T 3 ) = Tr(T 2 T 3 Ti), but is it the case that Tr(T,T 2 T 3 ) = Tr(T 2 T 1 T 3 ), in gen¬ 
eral? Prove it, or disprove it. Hint: The best disproof is always a counterexample— 
and the simpler the better! 


Problem 3.16 Show that the rows and columns of a unitary matrix constitute 
orthonormal sets. 


3.1.4 Eigenvectors and Eigenvalues 

Consider the linear transformation in three-space consisting of a rotation, about some 
specified axis, by an angle 9. Most vectors will change in a rather complicated way 
(they ride around on a cone about the axis), but vectors that happen to lie along the 
axis have very simple behavior: They don’t change at all (T\a) — |of)). If 9 is 180°, 
then vectors which lie in the the “equatorial” plane reverse signs (f |a) — —la)). In a 
complex vector space, 11 every linear transformation has “special” vectors like these, 
which are transformed into simple multiples of themselves: 

T\a)=X\a)- [3.68] 


11 This is not always true in a real vector space (where the scalars are restricted to real values). See 
Problem 3.17. 
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they are called eigenvectors of the transformation, and the (complex) number A is 
their eigenvalue. (The null vector doesn’t count, even though, in a trivial sense, it 
obeys Equation 3.68 for any T and any X ; technically, an eigenvector is any nonzero 
vector satisfying Equation 3.68.) Notice that any (nonzero) multiple of an eigenvector 
is still an eigenvector with the same eigenvalue. 

With respect to a particular basis, the eigenvector equation assumes the matrix 

form 

Ta = Aa [3.69] 

(for nonzero a), or 

(T - Xl)a = 0. [3.70] 


(Here 0 is the zero matrix, whose elements are all zero.) Now, if the matrix (T — Al) 
had an inverse, we could multiply both sides of Equation 3.70 by (T — Al) -1 , and 
conclude that a = 0. But by assumption a is not zero, so the matrix (T — Al) must 
in fact be singular, which means that its determinant vanishes: 


det(T - XI) = 


(7ii - A.) T n 
T 21 (T 22 — X) 


Tu 

T 2n 


Tn 1 T nl ... (T nn -X) 

Expansion of the determinant yields an algebraic equation for X: 

C n X n T C n —\X n 1 -T • • • + CiA. -j- Cq = 0, 


= 0. [3.71] 


[3.72] 


where the coefficients C, depend on the elements of T (see Problem 3.19). This is 
called the characteristic equation for the matrix; its solutions determine the eigen¬ 
values. Notice that it’s an wth-order equation, so it has n (complex) roots. 12 However, 
some of these may be duplicates, so all we can say for certain is that annx« matrix 
has at least one and at most n distinct eigenvalues. To construct the corresponding 
eigenvectors it is generally easiest simply to plug each X back into Equation 3.69 and 
solve “by hand” for the components of a. I’ll show you how it goes by working out 
an example. 


Example. Find the eigenvalues and eigenvectors of the following matrix: 


/ 2 0 
M = ( -2 i i 

V 1 0 


-2 
2 i 
-1 


)■ 


The characteristic equation is 

(2 - A) 0 -2 

—2 i (/ — X) 2 i 

1 0 (—1 — A) 


—X 3 + (1 + i)X 2 - iX = 0, 


[3.73] 


[3.74] 


12 It is here that the case of real vector spaces becomes more awkward, because the characteristic 
equation need not have any (real) solutions at all. See footnote 11 and Problem 3.17. 
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and its roots are 0, 1, and i. Call the components of the first eigenvector (a\, a 2 , a 3 ); 
then 


2 

-2 i 


0 

i 

0 



which yields three equations: 


2ai — 2 a 3 = 0, 
—2 ia\ + ia 2 2ztz 3 = 0, 

ai — a-x, = 0. 


The first determines a 3 (in terms of a \): a 3 — a\ \ the second determines a 2 \ a 2 = 0; 
and the third is redundant. We may as well pick a\ = 1 (since any multiple of an 
eigenvector is still an eigenvector): 


a (1) = 


for Xi — 0. 


[3.75] 


For the second eigenvector (recycling the same notation for the components) 
we have 


2 

-2 i 


0 

i 

0 




which leads to the equations 


2a\ — 2a 3 = a\, 
—2ia\ + ia 2 + 2/«3 = a 2 , 
a\ -a 3 = a 3 , 


with the solution a 3 = (l/2)ai, a 2 — [(1 — i)/2\a\ \ this time we’ll pick a\ — 2, so 
that 



for k 2 = 1. 


[3.76] 


Finally, for the third eigenvector, 


/ 2 0 -2\ /«A /a i \ /ia x \ 

I —2 i z 2 i } | a 2 } = t ( a 2 1 = ( ia 2 J, 

V 1 0 — 1/ \« 3 / Vfl 3 / \ia 3 / 


which gives the equations 


2ai — 2a 3 = z'«i, 
—2i'«i + za 2 + 2 ia 3 — ia 2 , 
— a 3 = ia 3 , 
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whose solution is = a\ = 0, with a 2 undetermined. Choosings = 1, we conclude 


a (3) = ^ 1 j , for A 3 = i. 


If the eigenvectors span the space (as they do in the preceding example), we 
are free to use them as a basis: 

T\fi) = Wi>, 

T\f 2 ) = A 2 |/ 2 >, 


T\fn) = K\fn). 

The matrix representing T takes on a very simple form in this basis, with the eigen¬ 
values strung out along the main diagonal and all other elements zero: 

/A, 0 ... 0\ 

0 A 2 ... 0 


= 0 


A matrix that can be brought to diagonal form (Equation 3.78) by a change 
of basis is said to be diagonalizable. The similarity matrix that accomplishes the 
transformation can be constructed by using the eigenvectors (in the old basis) as the 
columns of S^ 1 : 

(S _1 )iy = (a w )i. [3.80] 


\0 0 ... K 

The (normalized) eigenvectors are equally simple: 


Example (cont’d). In the example, 


n 2 (K 

S " 1 = 0 (i -0 1 , 
\i 1 0/ 


/ -1 0 2 

s = I 1 0 -1 

Vo'-i) 1 (i-o 


so (using Equation 3.57) 
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and you can check for yourself that 

/0 0 0 \ 
SMS -1 = ( 0 1 0). 
\0 0 i/ 


There is a great advantage in bringing a matrix to diagonal form—it’s much easier to 
work with. Unfortunately, not every matrix can be diagonalized—the eigenvectors 
have to span the space. For an example of a matrix that cannot be diagonalized, see 
Problem 3.18. 


^Problem 3.17 The 2x2 matrix representing a rotation of the xy-plane is 

T _ /cos 0 — sin# \ 

— \ sin 6 cos 9 )' 

Show that (except for certain special angles—what are they?) this matrix has no real 
eigenvalues. (This reflects the geometrical fact that no vector in the plane is carried 
into itself under such a rotation; contrast rotations in three dimensions.) This matrix 
does, however, have complex eigenvalues and eigenvectors. Find them. Construct 
a matrix S which diagonalizes T. Perform the similarity transformation (STS -1 ) 
explicitly, and show that it reduces T to diagonal form. 

Problem 3.18 Find the eigenvalues and eigenvectors of the following matrix: 


M = 


( 


!)■ 


Can this matrix be diagonalized? 

Problem 3.19 Show that the first, second, and last coefficients in the characteristic 
equation (Equation 3.72) are 

C n = (-1)", C„_! = (—l)" -1 Tr(T), and C 0 = det(T). [3.81] 
For a 3 x 3 matrix with elements T tJ , what is Ci? 

Problem 3.20 It is pretty obvious that the trace of a diagonal matrix is the sum of 
its eigenvalues, and its determinant is their product (see Equation 3.78). It follows 
(from Equations 3.64 and 3.67) that the same holds for any diagonalizable matrix. 
Prove that 

det(T) =X l X 2 ---k n , Tr(T) = Aj + A 2 +-b [3.82] 

for any matrix. (The A’s are the n solutions to the characteristic equation—in the case 
of multiple roots, there may be fewer linearly independent eigen vectors than there 
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are solutions, but we still count each A. as many times as it occurs.) Hint: Write the 
characteristic equation in the form 


(A 1 -A)(A 2 -A)---(A n -A)=0, 


and use the result of Problem 3.19. 


3.1.5 Hermitian Transformations 

In Equation 3.48 I defined the Hermitian conjugate (or “adjoint”) of a matrix as its 
transpose conjugate: = T*. Now I want to give you a more fundamental definition 

for the Hermitian conjugate of a linear transformation: It is that transformation T' 
which, when applied to the first member of an inner product, gives the same result as 
if T itself had been applied to the second vector: 

(t'a\P) = (a I Tp) [3.83] 

(for all vectors |a) and |^)). 13 [I have to warn you that although everybody uses it, this 
is lousy notation. For a and /3 are not vectors (the vectors are \a) and \fi)), they are 
labels —serial numbers (“F43A-9GT”), or names (“Charlie”), or bar codes—anything 
you care to use to identify the different vectors. In particular, they are endowed with no 
mathematical properties at all, and the expression ‘T/S” is literally nonsense: linear 
transformations act on vectors, not labels. But it’s pretty clear what the notation 
means: \ Tfi) means T\fi), and (f^a\f3) means the inner product of the vector T^a) 
with the vector |/3). Notice in particular that 

(a\cf) = c{a\f), [3.84] 

but 

{ca\P) =c*{a\f) [3.85] 

for any scalar c .] If you’re working in an orthonormal basis (as we always shall), 
the Hermitian conjugate of a linear transformation is represented by the Hermitian 
conjugate of the corresponding matrix (so the terminology is consistent); for (using 
Equations 3.50 and 3.53), 

(a\fp) = a*Tb = (T f a)*b = [3.86] 

In quantum mechanics, a fundamental role is played by Hermitian transforma¬ 

tions (7^ = f). The eigenvectors and eigenvalues of a Hermitian transformation 
have three crucial properties: 


13 If you’re wondering whether such a transformation necessarily exists, you should have been a 
math major. Still, it’s a good question, and the answer is yes. See, for instance, Halmos, (footnote 1), 
Section 44. 
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1. The eigenvalues of a Hermitian transformation are real. 

Proof: Let X be an eigenvalue of f: T\a) = A.|a), with \a) ^ |0). Then 

(a | fa) = (alAa) = A.{a|of). 

Meanwhile, if T is Hermitian, then 

(a | fa) — (fa\a) = (Xa\a) = A*(a|a). 

But (ala) ^ 0 (Equation 3.20), so X = A.*, and hence X is real. QED 

2. The eigenvectors of a Hermitian transformation belonging to dis¬ 
tinct eigenvalues are orthogonal. 

Proof: Suppose f\a) — A]a) and f \f) — /x|/i), with X ^ g. Then 

(a|f^) = (a | fi0) = ii(a\P), 


and if f is Hermitian, 


(a\m = (fa\P) = (ka\P)=k*(a\fi). 

But X = X* (from property 1), and X / /x, by assumption, so (a\f$) = 0. QED 

3. The eigenvectors of a Hermitian transformation span the space. 

Comment: If all the n roots of the characteristic equation are distinct, then (by 
property 2) we have n mutually orthogonal eigenvectors, so they obviously span the 
space. But what if there are duplicate roots (or, as they are called, in this context, 
degenerate eigenvalues)? Suppose X is m-fold degenerate; any linear combination 
of two eigenvectors belonging to the same eigenvalue is still an eigenvector (with 
the same eigenvalue)—what we must show is that there are m linearly independent 
eigenvectors with eigenvalue X. The proof is given in most books on linear algebra, 14 
and I shall not repeat it here. These eigenvectors can be orthogonalized by the Gram- 
Schmidt procedure (see Problem 3.4), so in fact the eigenvectors of a Hermitian 
transformation can always be taken to constitute an orthonormal basis. It follows, in 
particular, that any Hermitian matrix can be diagonalized by a similarity transfor¬ 
mation, with S unitary. This rather technical result is, in a sense, the mathematical 
support on which much of quantum mechanics leans. As we shall see, it turns out to 
be a thinner reed than one might have hoped. 


l4 I like the treatment in F. W. Byron, Jr., and R. W. Fuller, Mathematics of Classical and Quantum 
Physics (Reading, MA: Addison-Wesley, 1969), Vol. I, Section 4.7. 



Chap. 3 Formalism 


Problem 3.21 A Hermitian linear transformation must satisfy <a| Tp) = {fa\p) 
for all vectors |a) and |/3). Prove that it is (surprisingly) sufficient that {y\fy) = 
{fy\y) for all vectors | y). Suppose you could show that (e n \Te„) = {fe n \e n ) for 
every member of an orthonormal basis. Does it necessarily follow that T is Hermitian? 
Hint : First let | y) = |or) + |/3), and then let | y) = |of) + i\P). 


❖Problem 3.22 Let 


-(.i. 


l -i 
0 


)■ 


(a) Verify that T is Hermitian. 

(b) Find its eigenvalues (note that they are real). 

(c) Find and normalize the eigenvectors (note that they are orthogonal). 

(d) Construct the unitary diagonalizing matrix S, and check explicitly that it diag¬ 
onalizes T. 

(e) Check that det(T) and Tr(T) are the same for T as they are for its diagonalized 
form. 

**Problem 3.23 Consider the following Hermitian matrix: 



(a) Calculate det(T) and Tr(T). 

(b) Find the eigenvalues of T. Check that their sum and product are consistent with 
(a), in the sense of Equation 3.82. Write down the diagonalized version of T. 

(c) Find the eigenvectors of T. Within the degenerate sector, construct two linearly 
independent eigenvectors (it is this step that is always possible for a Hermitian 
matrix, but not for an arbitrary matrix—contrast Problem 3.18). Orthogonalize 
them, and check that both are orthogonal to the third. Normalize all three 
eigenvectors. 

(d) Construct the unitary matrix S that diagonalizes T, and show explicitly that the 
similarity transformation using S reduces T to the appropriate diagonal form. 

Problem 3.24 A unitary linear transformation is one for which UW — 1. 

(a) Show that unitary transformations preserve inner products, in the sense that 
{Ua\Ufi) = (of|^), for all vectors (a), |/3). 
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(b) Show that the eigenvalues of a unitary transformation have modulus 1. 

(c) Show that the eigenvectors of a unitary transformation belonging to distinct 
eigenvalues are orthogonal. 


3.2 FUNCTION SPACES _ 

We are ready now to apply the machinery of linear algebra to the interesting and 
important case of function spaces, in which the “vectors” are (complex) functions of 
x, inner products are integrals, and derivatives appear as linear transformations. 

3.2.1 Functions as Vectors 

Do functions really behave as vectors? Well, is the sum of two functions a function? 
Sure. Is addition of functions commutative and associative? Indeed. Is there a “null” 
function? Yes: f{x) = 0. If you multiply a function by a complex number, do you 
get another function? Of course. Now, the set of all functions is a bit unwieldy—we’ll 
be concerned with special classes of functions, such as the set of all polynomials of 
degree < N (Problem 3.2), or the set of all odd functions that go to zero at x — 1, or 
the set of all periodic functions with period n . Of course, when you start imposing 
conditions like this, you’ve got to make sure that you still meet the requirements for 
a vector space. For example, the set of all functions whose maximum value is 3 
would not constitute a vector space (multiplication by 2 would give you functions 
with maximum value 6, which are outside the space). 

The inner product of two functions [f{x) and g(x)\ is defined by the integral 


</i«> = J m'g(x) dx 


[3.87] 


(the limits will depend on the domain of the functions in question). You can check 
for yourself that it satisfies the three conditions (Equations 3.19, 3.20, and 3.21) for 
an inner product. Of course, this integral may not converge, so if we want a function 
space with an inner product, we must restrict the class of functions so as to ensure that 
{ f\g ) is always well defined. It is clearly necessary that every admissible function 
be square integrable: 

j \f(x)\ 2 dx <oc [3.88] 

(otherwise the inner product of / with itself wouldn’t even exist). As it turns out, 
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Table 3.1: The first few Legendre polynomials, P n (x). 


^0 = 1 
Pl=x 

P 2 = \{ 3x 2 -1) 

Pi = ^(5x 3 -3x) 

P A = |(35x 4 — 30x 2 + 3) 

P 5 = i(63x 5 - 70x 3 + 15x) 


this restriction is also sufficient —if / and g are both square integrable, then the 
integral in Equation 3.87 is necessarily finite . 15 

For example, consider the set P(N) of all polynomials of degree < N: 

p(x) = ao + a\x + a 2 x 2 H- 1 - a N -\x N ~ l , [3.89] 

on the interval —1 < x < 1. They are certainly square integrable, so this is a bona 
fide inner product space. An obvious basis is the set of powers of x : 

|ci> = 1. \e 2 )=x, \ei) = x 2 , .... \e N )=x N ~'- [3.90] 

evidently it’s an iV-dimensinal vector space. This is not, however, an orthonormal 
basis, for 

(e\\e\) = J ldx = 2, (^ 1 ^ 3 ) = j x 2 dx = 2/3, 

and so on. If you apply the Gram-Schmidt procedure, to orthonormalize this ba¬ 
sis (Problem 3.25), you get the famous Legendre polynomials, P n (x) (except that 
Legendre, who had other things on his mind, didn’t normalize them properly): 

= (n = l,2,...,JV). [3.91] 

In Table 3.11 have listed the first few Legendre polynomials. 

^Problem 3.25 Orthonormalize the powers of x, on the interval — 1 < x < 1 , to 
obtain the first four Legendre polynomials (Equation 3.91). 

^Problem 3.26 Let T ( N ) be the set of all trigonometric functions of the form 

N-\ 

f{x) = sin(n 7 TJc) + b n cos(n 7 rx)], [3.921 


l5 There is a quick phoney “proof’ of this, based on the Schwarz inequality (Equation 3.27). The 
trouble is, we assumed the existence of the inner product in proving the Schwarz inequality (Problem 3.5), 
so the logic is circular. For a legitimate proof, see F. Riesz and B. Sz.-Nagy, Functional Analysis (New 
York: Unger, 1955), Section 21. 
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on the interval — 1 < x < 1. Show that 


\e n ) =—e in7TX , (w =0,±1,...,±(W-1)) [3.93] 

V2 

constitutes an orthonormal basis. What is the dimension of this space? 

Problem 3.27 Consider the set of all functions of the form p(x)e~ x2/2 , where p(x) 
is again a polynomial of degree < N in x, on the interval —oc < x < oo. Check that 
they constitute an inner product space. The “natural” basis is 

\e\) = <r x2/2 , \e 2 ) = xe~ x2/1 , |e 3 ) = x 2 e~ x2/2 , ..., \e N ) = x N ~ l e~ x2/2 . 

Orthonormalize the first four of these, and comment on the result. 


3.2.2 Operators as Linear Transformations 

In function spaces operators (such as d/dx, d 2 /dx 2 , or simply x) behave as linear 
transformations, provided that they carry functions in the space into other functions 
in the space and satisfy the linearity condition (Equation 3.29). For example, in the 
polynomial space P(N ) the derivative operator (D = d/dx ) is a linear transforma¬ 
tion, but the operator x (multiplication by x) 16 is not, for it takes (N — l)th-order 
polynomials into A/th-order polynomials, which are no longer in the space. 

In a function space, the eigenvectors of an operator f are called eigenfunctions: 

Tf{x) = Xf{x). [3.94] 

For example, the eigenfunctions of D are 

f x {x) = Ae Xx . [3.95] 

Evidently this operator has only one eigenfunction (the one with X = 0) in the space 

P{ N). 

A Hermitian operator is one that satisfies the defining condition (Equation 3.83): 

(f\fg) = {ff\g), [3.96] 

for all functions f(x) and g(x) in the space. Is the derivative operator Hermitian? 
Well, using integration by parts, we get 

(f\Dg) =ff'^dx = (rg)t - f ~gdx = (fg) \ b a - (Df\g). [3.97] 


16 For consistency, I’ll put a hat on a- when I’m emphasizing its role as an operator, but you’re 
welcome to ignore it if you think I’m being too fastidious. 
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It’s close, but the sign is wrong, and there’s an unwanted boundary term. The sign is 
easily disposed of: D itself is (except for the boundary term) skew Hermitian, so i D 
would be Hermitian—complex conjugation of the i compensates for the minus sign 
coming from integration by parts. As for the boundary term, it will go away if we 
restrict ourselves to functions which have the same value at the two ends: 

f(b) = f(a). [3.98] 

In practice, we shall almost always be working on the infinite interval (a = — oo, b = 
+oo), where square integrability (Equation 3.88) guarantees that f(a) = f(b) = 0, 
and hence that i D is Hermitian. But i D is not Hermitian in the polynomial space 
P(N). 

By now you will realize that when dealing with operators you must always keep 
in mind the function space you’re working in—an innocent-looking operator may not 
be a legitimate linear transformation, because it carries functions out of the space; 
the eigenfunctions of an operator may not reside in the space; and an operator that’s 
Hermitian in one space may not be Hermitian in another. However, these are relatively 
harmless problems—they can startle you, if you’re not expecting them, but they don’t 
bite. A much more dangerous snake is lurking here, but it only inhabits vector spaces 
of infinite dimension. I noted a moment ago that x is not a linear transformation in 
the space P(N) (multiplication by x increases the order of the polynomial and hence 
takes functions outside the space). However, it is a linear transformation on P(o o), 
the space of all polynomials on the interval — 1 < x < 1. In fact, it’s a Hermitian 
transformation, since (obviously) 

j [f(x)T[xg(x)]dx = j [x/(x)]*[g(x)]c/x. 

But what are its eigenfunctions? Well, 


x(a Q + a\x + a 2 X 2 H-) = A(ao + a\x + a 2 x 2 + ■•■), 

for all x, means 

0 — A.(2q, 
ao — A ,a \, 
a\ — Aa 2 , 

and so on. If A = 0, then all the components are zero, and that’s not a legal eigenvector; 
but if A / 0, the first equation says ao = 0, so the second gives a\ = 0, and the third 
says ai = 0, and so on, and we’re back in the same bind. This Hermitian operator 
doesn’t have a complete set of eigenfunctions—in fact it doesn’t have any at all! Not, 
at any rate, in P(o o). 

What would an eigenfunction of x look like? If 


xg(x) = Ag(x), 


[3.99] 
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where X, remember, is a constant, then everywhere except at the one point x = X we 
must have g(x) = 0. Evidently the eigenfunctions of x are Dirac delta functions: 

gx(x) = B8(x - X), [3.100] 

and since delta functions are certainly not polynomials, it is no wonder that the 
operator x has no eigenfunctions in P(oo). 

The moral of the story is that whereas the first two theorems in Section 3.1.5 
are completely general (the eigenvalues of a Hermitian operator are real, and the 
eigenvectors belonging to different eigenvalues are orthogonal), the third one (com¬ 
pleteness of the eigenvectors) is valid (in general) only for finite-dimensional spaces. 
In infinite-dimensional spaces some Hermitian operators have complete sets of eigen¬ 
vectors (see Problem 3.32d for an example), some have incomplete sets, and some (as 
we just saw) have no eigenvectors (in the space) at all. 17 Unfortunately, the complete¬ 
ness property is absolutely essential in quantum mechanical applications. In Section 
3.3 I’ll show you how we manage this problem. 

Problem 3.28 Show that exp(-x 2 /2) is an eigenfunction of the operator Q = 
{d 2 fdx 2 ) — x 2 , and find its eigenvalue. 

❖Problem 3.29 

(a) Construct the matrix D representing the derivative operator D = d/dx with 
respect to the (nonorthonormal) basis (Equation 3.90) in P{N). 

(b) Construct the matrix representing D with respect to the (orthonormal) basis 
(Equation 3.93) in the space T(N) of Problem 3.26. 

(c) Construct the matrix X representing the operator x = x with respect to the basis 
(Equation 3.90) in P( oo). If this is a Hermitian operator (and it is), how come 
the matrix is not equal to its transpose conjugate? 

❖❖Problem 3.30 Construct the matrices D and X in the (orthonormal) basis (Equa¬ 
tion 3.91) for P(o o). You will need to use two recursion formulas for Legendre 
polynomials: 


xP„(x) = 


1 


(2n + 1) 


[(« + l)P„+i(x)3-nP n ^(x)]: 


dPn 

dx 


= J^(2« -4k- \)P n - 


■i(*)> 


[3.101] 

[3.102] 


17 In an n-dimensional vector space, every linear transformation can be represented (with respect 
to a particular basis) by an n x n matrix, and as long as n is finite, the characteristic Equation 3.71 is 
guaranteed to deliver at least one eigenvalue. But if n is infinite , we can’t take the determinant, there is no 
characteristic equation, and hence there is no assurance that even a single eigenvector exists. 
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where the sum cuts off at the first term with a negative index. Confirm that X is 
Hermitian but i D is not. 

Problem 3.31 Consider the operator/) 2 = d 2 jdx 1 . Under what conditions (on the 
admissable functions) is it a Hermitian operator? Construct the matrix representing 
D 1 in P(N) (with respect to the basis Equation 3.90), and confirm that it is the square 
of the matrix representing D (Problem 3.29a). 

Problem 3.32 

(a) Show that iD is Hermitian in the space T(N) of Problem 3.26. 

(b) What are its eigenvalues and (normalized) eigenfunctions, in T ( N)1 

(c) Check that your results in (b) satisfy the three theorems in Section 3.1.5. 

(d) Confirm that iD has a complete set of eigenfunctions in T (oo) (quote the perti¬ 
nent theorem from Fourier analysis). 


3.2.3 Hilbert Space 


To construct the real number system, mathematicians typically begin with the integers , 
and use them to define the rationals (ratios of integers). They proceed to show that the 
rational numbers are “dense,” in the sense that between any two of them (no matter 
how close together they are) you can always find another one (in fact, infinitely many 
of them). And yet, the set of all rational numbers has “gaps” in it, for you can easily 
think of infinite sequences of rational numbers whose limit is not a rational number. 
For example, 


i 111 1 

_ 2 + 3 4 + "' ± N 


[3.103] 


is a rational number for any finite integer N, but its limit (as N oo) is In 2, which 
is not a rational number. So the final step in constructing the real numbers is to “fill in 
the gaps”, or “complete” the set, by including the limits of all convergent sequences 
of rational numbers. (Of course, some sequences don’t have limits, and those we do 
not include. For example, if you change the minus signs in Equation 3.103 to plus 
signs, the sequence does not converge, and it doesn’t correspond to any real number.) 

The same thing happens with function spaces. For example, the set of all 
polynomials, P(o o), includes functions of the form 


/*(*) = l+x + ~ + - + - + ■ 


2 ' 3! + 4! 

(for finite N), but it does not include the limit as N oo: 

v 2 v 3 oo n 

A XX x —v X v 

i +, +T+ - + ...=j]-= e ». 


+ m 


n= 0 


[3.104] 


[3.105] 
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For e x is not itself a polynomial, although it is the limit of a sequence of polynomials. 
To complete the space, we would like to include all such functions. Of course, some 
sequences of polynomials don’t have limits, or have them only for a restricted range 
of x. For example, the series 

9 i 1 

1 +x + x H-- -- 

1 — X 

converges only for |jc| < 1. And even if the sequence does have a limit, the limit 
function may not be square integrable, so we can’t include it in an inner product space. 
To complete the space, then, we throw in all square-integrable convergent sequences 
of functions in the space. Notice that completing a space does not involve the intro¬ 
duction of any new basis vectors; it is just that we now allow linear combinations 
involving an infinite number of terms, 

\a) = ^aj\ej), [3.106] 

j =i 

provided (a\ct) is finite—which is to say (if the basis is orthonormal), provided 

OO 

£>/<». [ 3 . 107 ] 

i=i 

A complete 18 inner product space is called a Hilbert space. 19 The completion 
of P(oc) is easy to characterize: It is nothing less than the set of all square-integrable 
functions on the interval — 1 < x < +1; we call it L 2 {—\, +1). More generally, 
the set of all square-integrable functions on the interval a < x < b is L 2 (a, b). We 
shall be concerned primarily with the Hilbert space L 2 {—oo, +oo) (or L 2 , for short), 
because this is where quantum mechanical wave functions live. Indeed, to physicists 
L 2 is practically synonymous with “Hilbert space”. 

The eigenfunctions of the Hermitian operators ib = id/dx and x = x are of 
particular importance. As we have already found (Equations 3.95 and 3.100), they 
take the form 


fk(x) = A x e lkx , and g k (x) = B k 6{x - X), 

respectively. Note that there is no restriction on the eigenvalues —every real number 
is an eigenvalue of ib, and every real number is an eigenvalue of x. The set of 
all eigenvalues of a given operator is called its spectrum; i D and x are operators 
with continuous spectra, in contrast to the discrete spectra we have encountered 


l8 Note the two entirely different uses of the word “complete”: a set of vectors is complete if it spans 
the space; an inner product space is complete if it has no “holes” in it (i.e., it includes all its limits). 

19 Every finite-dimensional inner product space is trivially complete, so they’re all technically Hilbert 
spaces, but the term is usually reserved for in/zm'/e-dimensional spaces. 
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heretofore. Unfortunately, these eigenfunctions do not lie in Hilbert space , and hence, 
in the strictest sense, do not count as vectors at all. For neither of them is square- 
integrable: 


/: 


7 . 


fx(x)* fx(x)dx = \A k | 2 / e lkx e lkx dx = \A k | 2 / 1 dx -» oo, 


£ 


and 


j &(*)*&«<* = m 2 J 8(x-X)8(x-X)dx = \B,.\ 2 8(X-X)-* oo. 
Nevertheless, they do satisfy a kind of orthogonality condition: 

/ OO fOO 

fx(x)* fjx(x) dx = A* X A^ j e lkx e~ l,xx dx = \Ax\ 2 2jt8(X - p) 

(see Equation 2.126), and 

/ OO fOO 

gx(x)*g^(x) dx = BlB^ J 8(x - X)8(x -p)dx = \B x \ 2 S(X - p). 


It is customary to “normalize” these (unnormalizable) functions by picking the con¬ 
stant so as to leave an unadorned Dirac delta function on the right side (replacing the 
Kronecker delta in the usual orthonormality condition; Equation 3.23). 20 Thus 

fx(x) = -J=e~ iXx , with = 8(1 - p), [3.108] 

are the “normalized” eigenfunctions of iD, and 

&(*) = ~ *■), with (gklgp) = 8(X - p), [3.109] 

are the “normalized” eigenfunctions of x . 21 

What if we use the “normalized” eigenfunctions of i D and x as bases for L 2? 22 
Because the spectrum is continuous, the linear combination becomes an integral: 

/ oo poo 

ax\fx)dX-, | f) = J b x \g\)dX. [3.110] 


20 ril call this “normalization” (in quotes) so you won’t confuse it with the real thing. 

21 We are engaged here in a dangerous stretching of the rules, pioneered by Dirac (who had a kind 
of inspired confidence that he could get away with it) and disparaged by von Neumann (who was more 
sensitive to mathematical niceties), in their rival classics (P. A. M. Dirac, The Principles of Quantum 
Mechanics , first published in 1930, 4 th ed., Oxford (Clarendon Press) 1958, and J. von Neumann, The 
Mathematical Foundations of Quantum Mechanics, first published in 1932, revised by Princeton Univ. 
Press, 1955). Dirac notation invites us to apply the language and methods of linear algebra to functions 
that lie in the “almost normalizable” suburbs of Hilbert space. It turns out to be powerful and effective 
beyond any reasonable expectation. 

22 That’s right: We’re going to use, as bases, sets of functions none of which is actually in the space! 
They may not be normalizable, but they are complete, and that’s all we need. 
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Taking the inner product with |/ M ), and exploiting the “orthonormality” of the basis 
(Equation 3.108), we obtain the “components” a A \ 


a x(fu.\fx) dk = 


axS(fx — A) dX 


oo ^ 

Ox = (A\f) = -2= jj^f(x)dx = F(-X); [3.111] 

evidently the —A “component” of the vector | /), in the basis of eigenfunctions of i D, 
is the Fourier transform (Equation 2.85) of the function fix). Likewise, 

/ OC 

S(x — X)f(x)dx = /(A), [3.112] 

so the A “component” of the vector \f) in the position basis is /(A) itself. [If 
that sounds like double-talk, remember that \f) is an abstract vector, which can be 
expressed with respect to any basis you like; in this sense the function f(x) is merely 
the collection of its “components” in the particular basis consisting of eigenvectors of 
the position operator.] Meanwhile, we can no longer represent operators by matrices 
because the basis vectors are labeled by a nondenumerable index. Nevertheless, we 
are still interested in quantities of the form 

which, by force of habit, we shall call the A, p matrix element of the operator T. 

Problem 3.33 


(a) Show that any linear combination of two functions in L 2 (a, i>) is still in Z. 2 («, b). 
If this weren’t true, of course, £ 2 ( 0 , b) wouldn’t be a vector space at all. 

(b) For what range of (real) v is the function f(x) — \x\ v in L 2 {- 1 , +1)? 

(c) For what range of a is the function fix) — 1— x+x 2 — x 3 + - • -inL 2 i—a, +a)l 

(d) Show that the function fix) = e |jr| is in L 2 , and find its “components” in the 
basis (Equation 3.108). 

(e) Find the matrix elements of the operator D 2 with respect to the basis (Equation 

3.108) of 12- 

Problem 3.34 L 2 { — 1 , +l)includesJzxconn>iwoMxfunctions[suchasthestepfunc- 
tion, 0(x), Equation [2.125], which are not differentiable. But functions expressible 

as Taylor series (f(x) = a Q + a { x + a 2 x 2 H-) must be infinitely differentiable. 

How, then, can 6(x) be the limit of a sequence of polynomials? Note: This is not a 
difficult problem, once you see the light, but it is very subtle, so don’t waste a lot of 
time on it if you’re not getting anywhere. 
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3.3 THE GENERALIZED STATISTICAL 
INTERPRETATION 


My next project is to recast the fundamental principles of quantum mechanics (as 
developed in Chapters 1 and 2) into the more elegant language of linear algebra. 
Remember that the state of a particle is represented by its wave function, ^(x, t), 
whose absolute square is the probability density for finding the particle at point x, at 
time t. It follows that *P must be normalized , which is possible (by dividing off a 
constant) if and only if it is square integrable. Thus 

1. The state of a particle is represented by a normalized vector (| *P)) in the 
Hilbert space Lj. 


Classical dynamical quantities (such as position, velocity, momentum and ki¬ 
netic energy) can be expressed as functions of the “canonical” variables x and p 
(and, in rare cases, t): Q(x, p, t). To each such classical observable we associate a 
quantum-mechanical operator, Q, obtained from Q by the substitution 


P 


n d 

i dx 


The expectation value of Q, in the state T', is 


[3.113] 


( 2 ) = / 


4>(x, t)*Qty(x, t)dx. 


which we now write as an inner product: 23 

{Q) = mQ^). 


[3.114] 


Now, the expectation value of an observable quantity has got to be a real number 
(after all, it corresponds to actual measurements in the laboratory, using rulers and 
clocks and meters), so 


(XPI0VP) = <vp|£vp)* = (gxpjvp), [3.115] 

for all vectors | *P). It follows (see Problem 3.21) that Q must be a Hermitian operator. 
Thus 


2. Observable quantities, Q(x, p, /), are represented by Hermitian opera¬ 
tors, Q(x , the expectation value of Q , in the state I'P), is {'P| Q'F). 


23 The “lousy notation” I warned you about on page 92 is not so bad in this context, for we are using 
the function ^ itself to label the vector j^), and the expression Q'V is perfectly self-explanatory. 
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In general, identical measurements on identically prepared systems (all in the 
same state 'I') do not yield reproducible results; however, some states are determi¬ 
nate, for a particular observable, in the sense that they always give the same result. 
[A competent measurement of the total energy of a particle in the ground state of 
the harmonic oscillator, for example, will always return the value (1/2 )hco.] For a 
determinate state of observable Q, the standard deviation is zero: 

0 = 4 = ((Q- m 2 ) = wee - (fi)) 2 *) 

= m-[Q)m(Q-(Q))*) = ll(G- (e»l*)ll 2 - [3.H6] 

[I used the fact that the operator {Q — (Q)) is Hermitian to peel it off the second 
member of the inner product and attach it to the first member.] But the only vector 
with norm zero is the null vector (Equation 3.20), so (Q — (0)| V I / ) = 0, or 

ei'i<) = <e)i'i')- [3.ii7] 

Evidently determinate states are eigenvectors of Q. Thus 

3. A measurement of the observable Q on a particle in the state |T0 is 
certain to return the value X if and only if l^} is an eigenvector of O, with 
eigenvalue X. 

For example, the time-independent Schrodinger equation (Equation 2.11), 

Hf = Exir, 


is nothing but an eigenvalue equation for the Hamiltonian operator, and the solutions 
are states of determinate energy (as we noted long ago). 

Up to this point I have added nothing new to the statistical interpretation; I 
have merely explored its implications in the language of linear algebra. But there is 
a missing part to the story: Although we can calculate the average result of any mea¬ 
surement, we still cannot say what the probability of getting a particular result would 
be if we were to measure a given observable Q on a particle in an arbitrary state |T*) 
(except for the special case of position for which the original statistical interpretation 
supplies the answer). To finish the job, we need the following generalized statistical 
interpretation, which is inspired by postulate 3 above, and subsumes it as a special 
case: 


3 . If you measure an observable O on a particle in the state | T), you 
are certain to get one of the eigenvalues of Q. The probability of getting the 
particular eigenvalue X is equal to the absolute square of the X component of 
|'l'), when expressed in the orthonormal basis of eigenvectors. 24 


24 Notice that we could calculate from this the expectation value of Q, and it is important to check 
that the result is consistent with postulate 2 above. See Problem 3.35(c). 



106 Chap. 3 Formalism 


To sustain this postulate, it is essential that the eigenfunctions of Q span the 
space. As we have seen, in the^m/te-dimensional case the eigenvectors of a Hermi- 
tian operator always span the space. But this theorem fails in the infinite-dimensional 
case—we have encountered examples of Hermitian operators that have no eigenfunc¬ 
tions at all, or for which the eigenfunctions lie outside the Hilbert space. We shall 
take it as a restriction on the subset of Hermitian operators that are observable, that 
their eigenfunctions constitute a complete set (though they need not fall inside L 2 ). 2 ' 

Now, there are two kinds of eigenvectors, which we need to treat separately. If 
the spectrum is discrete (with the distinct eigenvalues separated by finite gaps), we 
can label the eigenvectors with an integer n: 

Q\e n ) = X n \e n ), with n = 1,2,3, ...; [3.118] 

the eigenvectors are orthonormal (or rather, they can always be chosen so): 

(e„\e m }=8 nm ; [3.119] 

the completeness relation takes the form of a sum: 

OO 

W = [3.120] 

n = 1 

the components are given by “Fourier’s trick”: 

c„ = <£„|'I'}, [3.121] 

and the probability of getting the particular eigenvalue X n is 


\c„\ 2 = ite,mi 2 . 


[3.122] 


On the other hand, if the spectrum is continuous , the eigenvectors are labeled 
by a continuous variable (k): 

Q\eu) = Xk\eu), with - 00 < k < 00 ; [3.123] 

the eigenfunctions are not normalizable (so they do not lie in L 2 , and do not themselves 
represent possible particle states), but they satisfy a sort of “orthonormality” condition 

{e k \e l )=S(k-l) [3.124] 


25 Some authors, following Dirac, take this to be an axiom of quantum mechanics, but it seems to 
me peculiar to use that term for something that is provable in many particular instances; I prefer to regard 
it as a part of what it means to be “observable”. 




Sec. 3.3: The Generalized Statistical Interpretation 107 


(or rather, they can always be chosen so); the completeness relation takes the form of 
an integral: 


m = 


/: 


c k \e k )dk\ 


[3.125] 


the “components” are given by “Fourier’s trick”: 


c k = (e k \^). 


[3.126] 


and the probability of getting an eigenvalue in the range dk about X k is 


\c k \ 1 dk = \(e„m :i dk. 


[3.127] 


The generalized statistical interpretation makes no reference to the observable 
v; it treats all observables on an equal footing. But it includes the “original” form 
(Equation 1.3) as a special case. The “orthonormal” eigenfunctions of the position 
operator are 

e x fx) = <5(jc — x '), [3.128] 


and the eigenvalue (*') can take on any value between —oo and -f-oo. The x' “com¬ 
ponent” of |) is 


/ OO 

S(x - x r )ty(x, t) dx = ty(x',t), 

-OO 


[3.129] 


so the probability of finding the particle in the range dx f about x' is 

\cA 2 dx' = yU(x’,t)\ 2 dx, [3.130] 


which is the original statistical interpretation of 

A more illuminating example is provided by the momentum operator. Its “or- 
thonormal” eigenfunctions are (see Problem 3.37) 

e p {x) = -f=e , '>^\ [3.131] 

y/lTZh 


and the eigenvalue ( p) can take on any value in the range — oo < p < oo. The p 
“component” of 14/) is 


c P = (e p |*) = 


V2 Tlh 


= f 

\tz Ti J- 


e ,px/n ^(x, t) dx = <${p, t). 


[3.132] 


We call <£>(/?, t) the momentum-space wave function—it is (apart from the factors 
of Ti) the Fourier transform of the “position-space” wave function (x, t). Evidently 
the probability of getting a momentum in the range dp is 


mp,t)\ 2 dp. 


[3.133] 
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❖Problem 3.35 

(a) Show that ^ \c n \ z = 1, in Equation 3.120. 

(b) Show that / \ck\ 2 dk = 1, in Equation 3.125. 

(c) From postulate 3' it follows that 

(Q) = £>"l c "l 2 ’ or <2> = / [3.134] 

for discrete and continuous spectra, respectively. Show that this is consistent 
with postulate 2: ( Q ) — ('I'jg'P). 


❖Problem 3.36 

(a) Refer to Problem 2.6. If you measured the energy of this particle, what values 
might you get, and what is the probability of each? Use the answer to calculate 
the expectation value of H , and compare the answer you got before. 

(b) Do the same for Problem 2.8. Hint : To sum the series, look in a math table 
under “Sums of Reciprocal Powers” or “Riemann Zeta Function.” 


Problem 3.3 7 Confirm that e p (x) (in Equation 3.131) is the “orthonormal” eigen¬ 
function of the momentum operator, with eigenvalue p. 

Problem 3.38 Find the momentum-space wave function, <t>(/?, f), for a particle in 
the ground state of the harmonic oscillator. What is the probability (to two significant 
digits) that a measurement of p on a particle in this state would yield a value outside 
the classical range (for the same energy)? Hint. Look in a math table under “Normal 
Distribution” or “Error Function” for the numerical part. 


3.4 THE UNCERTAINTY PRINCIPLE 


I stated the uncertainty principle (in the form a x a p >Ti /2) back in Section 1.6, and 
you have checked it several times in the problems. But we have never actually proved 
it. In this section I shall prove a more general version of the uncertainty principle and 
explore some of its implications. The argument is beautiful, but rather abstract, so 
watch closely. 

3.4.1 Proof of the Generalized Uncertainty Principle 

For any observable A, we have (quoting Equation 3.116) 

= <(A - <A»«K|(A - <A))H/> = </|/>, 
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where |/) = (A — (z4))|4>). Likewise, for any other observable B, 

<*1 = <g|g>. where |g) = (B - (B))\V). 

Therefore (invoking the Schwarz inequality. Equation 3.27), 

°Wb = (/l/Xglg) > l(/lg)| 2 . [3.135] 

Now, for any complex number z, 

|z| 2 = (Re(z)) 2 + (Im(z)) 2 > (Im(z)) 2 = [L(z - z*)] 2 . [3.136] 

Therefore, letting z = {f\g}, 

°Wb > (^[(/lg> - <gl/>]) • [3.137] 

But 


(f\g) = ((A - (A))V\(B - (B)m = (*\(A - (A))(B - (B))V) 

= (V\(AB - A(B) - B(A) + (A)(B)yV) 

= - (5)<'I'|i'T) - ( J 4)<vl/|i?'k} + <z4)< J 8){'k|'k) 

= (AB) - (B)(A) - (A)(B) + (A)(B) 

= (AB)-(A)(B). 

Similarly, 

(g\f) = (BA)~(A)(B), 
so 

(f\g) ~ (g\f) = (AB) - (BA) = ([A, B]), 

where 

[A, B] = AB — BA [3.138] 

is the commutator of the two operators. Conclusion: 



[3.139] 


This is the uncertainty principle in its most general form. (You might think the i 
makes it trivial—isn’t the right side negative ? No, for the commutator carries its own 
factor of i , and the two cancel out.) 

For example, suppose the first observable is position (A = x), and the second 
is momentum (B = ( h/i)d/dx ). To determine the commutator, we use an arbitrary 
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“test function”, fix): 


[x, p]f(x) 


h d , h d , „ 
x t if) - t ~r( x f) 

i ax i ax 


h 

i 


df 

x — 
dx 


( / + 4 ) ]= , '* / - 


[x, p ] = ih. 


Accordingly, 


=G) 


or, since standard deviations are by their nature positive, 

Ti 


[3.140] 


[3.141] 


That proves the original Heisenberg uncertainty principle, but we now see that 
it is just one application of a far more general theorem: There will be an “uncertainty 
principle” for any pair of observables whose corresponding operators do not com¬ 
mute. We call them incompatible observables. Evidently, incompatible observables 
do not have shared eigenvectors—at least, they cannot have a complete set of common 
eigenvectors. Matrices representing incompatible observables cannot be simultane¬ 
ously diagonalized (that is, they cannot both be brought to diagonal form by the 
same similarity transformation). On the other hand, compatible observables (whose 
operators do commute) share a complete set of eigenvectors, and the corresponding 
matrices can be simultaneously diagonalized (see Problem 3.40). 


❖Problem 3.39 Prove the famous “(your name) uncertainty principle,” relating the 
uncertainty in position (ri = x) to the uncertainty in energy (5 = p 2 /2m + V): 


OxGh > — I (/>)!■ 


For stationary states this doesn’t tell you much—why not? 


Problem 3.40 Prove the following: 


(a) If two matrices commute ([A, B] = 0), and you apply the same similarity 
transformation to both of them (A' = SAS -1 , B' = SBS -1 ), the resulting 
matrices also commute ([A', B'] = 0). 

(b) Diagonal matrices always commute. (It follows from this that simultaneously 
diagonalizable matrices must commute. Conversely, if two Hermitian matrices 
commute, then they are simulatneously diagonalizable—i.e., they have a com¬ 
plete set of common eigenvectors. This is not so easy to prove 26 unless you 
happen to know that the spectrum of one of them is nondegenerate.) 


26 See Byron and Fuller (footnote 14), Theorem 4.22. 



Sec. 3.4: The Uncertainty Principle 111 


(c) If matrices A and B commute, and |a) is an eigenvector of A, and the spectrum 
of A is nondegenerate, then |a} is also an eigenvector of B. (In that case the 
matrix S that diagonalizes A also diagonalizes B.) 

♦Problem 3.41 

(a) Prove the following commutator identity: 

[AB,C] = A[B,C] + [A,C]B. [3.142] 

(b) Using Equations 3.140 and 3.142, show that 

[. x n , p] = ihnx n ~ ] . 

(c) For any function /(*) that can be expanded in a power series, show that 

[f(x),p] = ihf'(x), 
where the prime denotes differentiation. 


3.4.2 The Minimum-Uncertainty Wave Packet 

We have twice encountered wave functions that hit the position-momentum uncer¬ 
tainty limit (a x a p = Ti /2): the ground state of the harmonic oscillator (Problem 2.14) 
and the Gaussian wave packet for the free particle (Problem 2.22). This raises an 
interesting question: What is the most general minimum-uncertainty wave packet? 
Looking back at the proof of the uncertainty principle, we note that there were two 
points at which inequalities came into the argument: Equation 3.135 and Equation 
3.136. Suppose we require that each of these be an equality, and see what this tells 
us about 4^. The Schwarz inequality becomes an equality when the angle between 
the two vectors (Equation 3.28) is zero—that is, when one is a multiple of the other: 
\g) = c\f), for some scalar c. (Study the proof of the Schwarz inequality in Problem 
3.5 if you’re not convinced.) Meanwhile, in Equation 3.1361 threw away the real part 
of z; equality results if Re(z) = 0, which is to say, if Re(/|g) =Re(c(/|/)) = 0. 
Now (/|/) is certainly real, so this means the constant c must be purely imaginary— 
let’s call it ia. The necessary and sufficient condition for minimum uncertainty, then, 
is 

\g) = ia\f), where a is real. [3.143] 

In particular, for the position-momentum uncertainty principle this criterion 
becomes 

(jlx~ < p> ) * = - <■*))*■ 


[3.144] 
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which is a differential equation for ^asa function of x, with the general solution 
(see Problem 3.42) 

vl> (x) = Ae-rt-Wf/toetlp)*/*. [3.145] 

Evidently the minimum-uncertainty wave packet is a Gaussian —and sure enough, 
the two examples we encountered earlier were Gaussians. 27 

Problem 3.42 Solve Equation 3.144 for 4/(x). (Note that (x) and ( p) are constants, 
as far as x is concerned.) 


3.4.3 The Energy-Time Uncertainty Principle 

The position-momentum uncertainty principle is usually written in the form 

Ax Ap > *. [3.146] 

Ax (the “uncertainty” in x) is sloppy notation (and sloppy language) for the standard 
deviation in the results of repeated measurements on identically prepared systems. 
Equation 3.146 is often paired with the energy-time uncertainty principle, 

AtAE> h ~. [3.147] 

Indeed, in the context of special relativity the energy-time form might be thought of as 
a consequence of the position-momentum version, because x and t (or rather, ct) go 
together in the position-time four-vector, while p and E (or rather, E/c) go together 
in the energy-momentum four-vector. So in a relativistic theory Equation 3.147 
would be a necessary concomitant to Equation 3.146. But we’re not doing relativistic 
quantum mechanics—the Schrodinger equation is explicitly nonrelativistic: It treats 
t and x on a very unequal footing (as a differential equation it is first -order in t, 
but second -order in x), and Equation 3.147 is emphatically not implied by Equa¬ 
tion 3.146. My purpose now is to derive the energy-time uncertainty principle, and 
in the course of that derivation to persuade you that it is really an altogether different 
beast, whose similarity in appearance to the position-momentum uncertainty principle 
is quite misleading. 

After all, position, momentum, and energy are all dynamical variables—measur¬ 
able characteristics of the system, at any given time. But time itself is not a dynamical 
variable (not, at any rate, in a nonrelativistic theory): You don’t go out and measure 
the “time” of a particle, as you might its position or its energy. Time is the indepen¬ 
dent variable of which the dynamical quantities are functions . In particular, the At 


27 Note that it is only the dependence of 'I' on x that is at issue here—the “constants” A, a, (x), and 
(p) may all be functions of time, and as time goes on vp may evolve away from the minimal form. All I’m 
asserting is that if, at some instant, the wave function is Gaussian in x, then (at that instant) the uncertainty 
product is minimal. 
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in the energy-time uncertainty principle is not the standard deviation of a collection 
of time measurements; roughly speaking (I’ll make this more precise in a moment), 
it is the time it takes the system to change substantially. 

As a measure of how fast the system is changing, let us compute the time 
derivative of the expectation value of some observable, Q(x, p, t): 

d d 94/ „ 90 *94/ 

j t (Q) = + t'pie-gj-). 

Now the Schrodinger equation says 

ih — = HV 
dt 

(where H = p 2 jlm + V is the Hamiltonian). So 




But H is Hermitian, so (tf4/|04/) = (4/| HQ^), and hence 



[3.148] 


This is an interesting and useful result in its own right (see Problems 3.43 and 3.53). 
In the typical case, where the operator does not depend explicitly on /, 28 it tells us 
that the rate of change of the expectation value is determined by the commutator of 
the operator with the Hamiltonian. In particular, if Q commutes with H, then ( Q) is 
constant, and in this sense Q is a conserved quantity. 

Suppose we pick A = H and B = Q, in the generalized uncertainty principle 
(Equation 3.139), and assume that Q does not depend explicitly on t: 


«=(W-(#*?)■-G-)W 

Or, more simply, 


°hoq > - I 


[3.149] 


. * \ d(Q) \ 

- i dt |' 

Let’s define A E = o H (with A as the usual sloppy notation for standard deviation), 


and 


t = -—-. 

\d(Q}/dt\ 


[3.150] 


28 As an example of explicit time dependence, think of the potential energy of a harmonic oscillator 
whose spring constant is changing (perhaps the temperature is rising, so the spring becomes more flexible): 
Q = (1/2 )m[(o(t)\ 2 x 2 . 
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Then 


AE At > 

‘ 2 


[3.151] 


and that’s the energy-time uncertainty principle. But notice what is meant by At here: 
Since 


: 


d(Q) 

dt 


At represents the amount of time it takes the expectation value of Q to change by one 
standard deviation. In particular, At depends entirely on what observable ( Q ) you 
care to look at—the change might be rapid for one observable and slow for another. 
But if AE is small, then the rate of change of all observables must be very gradual, 
and conversely, if any observable changes rapidly, the “uncertainty” in the energy 
must be large. 


Example 1. In the extreme case of a stationary state, for which the energy is 
uniquely determined, all expectation values are constant in time (At = oc)—as, in 
fact, we noticed some time ago (see Equation 2.8). To make something happen , you 
must take a linear combination of at least two stationary states—for example, 

'I'Oc,/) = a\j/\(x)e~ lElt/h + b-f- 2 (x)e~ lElt/h ■ 


If a, b, and \js 2 are real, 


|'I'(x,0| 2 = a 2 (ij/i(x)) 2 + b 2 (i/ 2 (x)) 2 + 2abfa(x)\f/ 2 (x) cos 



■y 


The period of oscillation is r = 2nh/(E 2 — Ef). Roughly, then, AE = E 2 — E\ and 
At = r (for the exact calculation, see Problem 3.44), so 


AE At = 2 7th > 


h 

2 ‘ 


Example 2. How long does it take a free particle wave packet to pass by a 
particular point (Figure 3.1)? Qualitatively (an exact version is explored in Problem 
3.45), At = Axfv = mAx/p, but E = p 1 /2m, so AE = pAp/m. Therefore, 


pApmAx 

AE At - - = Ax Ap> 

m p 


h 

2 ' 


Example 3. The A particle lasts about 10 -23 seconds before spontaneously 
disintegrating. If you make a histogram of all measurements of its mass, you get 
a kind of bell-shaped curve centered at 1232 MeV/c 2 , with a width of about 115 
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h 


Ax 


A x 

Figure 3.1: A free particle wave packet approaches the point A (Example 2). 

MeV/c 2 . Why does the rest energy (me 2 ) sometimes come out higher than 1232, and 
sometimes lower? Is this experimental error? No, for 

A E At = (-^ MeV) (10“ 23 sec) =6x 10“ 22 MeV sec, 

whereas hf 2 = 3 x 10 -22 MeV sec. So the spread in m is about as small as the 
uncertainty principle allows—a particle with so short a lifetime just doesn’t have a 
very well-defined mass. 29 


Notice the variety of specific meanings attaching to the term At in these exam¬ 
ples: In Example 1 it’s a period of oscillation; in Example 2 it’s the time it takes a 
particle to pass a point; in Example 3 it’s the lifetime of an unstable particle. In every 
case, however, At is the time it takes for the system to undergo substantial change. It 
is often said that the uncertainty principle means that energy is not strictly conserved 
in quantum mechanics—that you’re allowed to “borrow” energy A E, as long as you 
“pay it back” in a time At ~ h/2AE\ the greater the violation, the briefer the period 
over which it can occur. There are many legitimate readings of the energy-time un¬ 
certainty principle, but this is not one of them. Nowhere does quantum mechanics 
license violation of energy conservation, and certainly no such authorization entered 
into the derivation of Equation 3.151. But the uncertainty principle is extraordinar¬ 
ily robust: It can be misused without leading to seriously incorrect results, and as a 
consequence physicists are in the habit of applying it rather carelessly. 

♦Problem 3.43 Apply Equation 3.148 to the following special cases: (a) Q = 1; 
(b ) Q — (c) Q = x; (d) Q = p. In each case, comment on the result, with 

particular reference to Equations 1.27, 1.33, 1.38, and 2.35. 

♦♦Problem 3.44 Test the energy-time uncertainty principle for the wave function in 
Problem 2.6 and the observable x by calculating cr H , a x , and d(x) jdt exactly. 


29 Actually, Example 3 is a bit of a fraud. You can’t measure 10 -23 sec on a stop-watch, and in 
practice the lifetime of such a short-lived particle is inferred from the width of the mass plot, using the 
uncertainty principle as input. However, the point is valid even if the numbers are suspect. Moreover, if 
you assume the A is about the same size as a proton (~ 10“ 15 m), then 10“ 23 sec is roughly the time it 
takes light to cross the particle, and it’s hard to imagine that the lifetime could be much less than that. 



116 Chap. 3 Formalism 


**Problem 3.45 Test the energy-time uncertainty principle for the free particle wave 
packet in Problem 2.40 and the observable x by calculating oy/, o x , and d(x)/dt 
exactly. 

Problem 3.46 Show thatthe energy-time uncertainty principle reduces to the “your 
name” uncertainty principle (Problem 3.39) when the observable in question is x. 


FURTHER PROBLEMS FOR CHAPTER 3 _ 

**Problem 3.47 Functions of matrices are defined by their Taylor series expansions; 
for example, 

e M = 1 + M + X -M 2 + ^M 3 + • • •. [3.152] 


(a) Find exp(M), if 

/0 1 3 
(i) M = ( 0 0 4 
\0 0 0 

(b) Show that if M is diagonalizable, then 

det(e M ) = e Tr<M) . [3.153] 

(This is actually true even if M is not diagonalizable, but it’s harder to prove in 
tlie general case.) 

(c) Show that if the matrices M and N commute, then 

e M+N = e M e N [3.154] 


); (ii)M=(_°0 ° 0 y 


Prove (with the simplest counterexample you can think up) that Equation 3.154 
is not true, in general, for noncommuting matrices. 

(d) If H is Hermitian, show that e m is unitary. 

^Problem 3.48 A particle of mass m is in the ground state of the infinite square well 
(Equation 2.15). Suddenly the well expands to twice its original size—the right wall 
moving from a to 2 a —leaving the wave function (momentarily) undisturbed. The 
energy of the particle is now measured. 


(a) What is the most probable result? What is the probability of getting that result? 

(b) What is the next most probable result, and what is its probability? 
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(c) What is the expectation value of the energy? (If you find yourself confronted 
with an infinite series, try another method.) 

Problem 3.49 A harmonic oscillator is in a state such that a measurement of the 
energy would yield either (1/2 )Tko or (3/2 )hco, with equal probability. What is the 
largest possible value of {x) in such a state? If it assumes this maximal value at time 
t = 0, what is ^(x, t)l 

** Problem 3.50 Find the matrix elements (n\x\n') and (n\p\n) in the (orthonormal) 
basis consisting of stationary states for the harmonic oscillator (here |n) refers to the 
state ^«,Eq. 2.50). [You already calculated the diagonal elements (n — ri)m Problem 
2.37; use the same technique for the general case.] Construct the corresponding 
(infinite) matrices, X and P. Show that (l/2m)P 2 + (moo 2 / 2)X 2 = H is diagonal, in 
this basis. Are its diagonal elements what you would expect? Partial answer : 



♦♦Problem 3.51 Show that 




[3.156] 


where $>(p, t ) is the momentum-space wave function. In general, 

f f V*Q(x, l£., t )Vdx, in position space: 


{ Q(x,p,t )) = 


/ <3>*Q 7 P >$ dp, in momentum space. 


Hint: Notice that x exp (ipxjh) = —ih(d/dp) expdpx jfi). 

♦♦Problem 3.52 Find the momentum-space wave function 0„(p, i) for the nth sta¬ 
tionary state of the infinite square well. Construct |<3>„ | 2 (it’s simplest to write separate 
formulas for odd and even n). Show that |4>„ | 2 is finite at p = ±mth/a. 

♦Problem 3.53 Use Equation 3.148 to show that 

d dV 

-( X p)=2(T)-{x — ), [3.158] 

where T is the kinetic energy (H = T -j- V). In a stationary state the left side is zero 
(why?), so 

dV 

2{T) = (x —). [3.159] 

dx 

This is called the virial theorem. Use it to prove that (T) = {V) for stationary states 
of the harmonic oscillator, and check that this is consistent with the results you got 
in Problems 2.14 and 2.37. 
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Problem 3.54 What would it mean for an observable Q to be conserved, in quan¬ 
tum mechanics? At a minimum, the expectation value of Q should be constant in 
time, for any state 4/. The criterion for this (assuming Q has no explicit time de¬ 
pendence) is that Q commute with the Hamiltonian (Equation 3.148). But we’d like 
something more: The probability \c„\ 2 of getting any particular eigenvalue (X n ) of 
Q should be independent of t. Show that this, too, is guaranteed by the condition 
[H, Q] = 0. (Assume that the potential energy is independent oft, but do not assume 
vp is a stationary state.) Hint: Q and H are compatible observables, so they have a 
complete set of simultaneous eigenvalues. 

**Problem 3.55 

(a) For a function f(x) that can be expanded in a Taylor series, show that 

f(x+x 0 )=e‘™ /h f(.x ) 

(where x 0 is any constant distance). For this reason, p/h is called the generator 
of translations in space. (See Problem 3.47 for the meaning of an operator in 
the exponent.) 

(b) If *P(x, t) satisfies the (time-dependent) Schrodinger equation, show that 

vp(x, t + to) = e-t^Vix, t) 

(where t 0 is any constant time); —H/h is called the generator of translations 
in time. 

(c) Show that the expectation value of a dynamical variable Q(x, p, t ), at time 
t + to, can be written 

(<2>r+r„ = W*, t)\e ili '° /h Q(x, p, t + 1*0:, /». 

Use this to recover Equation 3.148. Hint : Let t Q = dt, and expand to first order 
in dt. 


Problem 3.56 In an interesting version of the energy-time uncertainty principle 30 
A t = x/n, where r is the time it takes vp(x, t) to evolve into a state orthogo¬ 
nal to vp(x, 0). Test this out, using a wave function that is an equal admixture 
of two (orthonormal) stationary states of some (arbitrary) potential: ^(x, 0) = 
(\/V2)[f } (x) + 

**Problem 3.57 Dirac proposed to peel apart the bracket notation for an inner prod¬ 
uct, ( ct\/3 ), into two pieces, which he called bra ({a|) and ket (|/3)). The latter is 
a vector, but what exactly is the former? It’s a linear function of vectors, in the 
sense that when it hits a vector (to its right) it yields a (complex) number—the inner 


30 See Lev Vaidman, Am. 1 Phys. 60, 182 (1992) for a proof. 



Further Problems for Chapter 3 119 


product. 31 (When an operator hits a vector, it delivers another vector; when a bra hits 
a vector, it delivers a number.) Actually, the collection of all bras constitutes another 
vector space—the so-called dual space. 

The license to treat bras as separate entities in their own right allows for some 
powerful and pretty notation (though I shall not exploit it further in this book). For 
example, if la) is a normalized vector, the operator 

P = \a){a\ [3.160] 

picks out the component of any other vector that “lies along” |a): 

P\P) = (<*ma); 

we call it the projection operator onto the one-dimensional subspace spanned by 

|a). 

(a) Show that P 2 — P. Determine the eigenvalues of P, and characterize its 
eigenvectors. 

(b) Suppose \ej) is an orthonormal basis for an n-dimensional vector space. Show 
that 

X>jX«/l = 1- [3.161] 

;=i 

This is the tidiest statement of completeness. 

(c) Let Q be an operator with a complete set of orthonormal eigenvectors: 

Q\ej)=Xj\ej) (j = 1,2, 3,...«). 

Show that Q can be written in terms of its spectral decomposition: 

2 = I>.;le/}(e;l- [3.162] 

j = 1 

Hint: An operator is characterized by its action on all possible vectors, so what 
you must show is that 

Q\a) = J2^j\ej){ej\ |a), 

.7=1 

for any vector |a). 


31 In a function space, the bra can be thought of as an instruction to integrate 

(/i = J rv-ux, 

with the “hole” [• • •] waiting to be filled by whatever function the bra encounters next. 
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^Problem 3.58 Imagine a system in which there are just two linearly independent 
states: 

!» = (i) 111111 l 2 > = (?)' 

The most general state is a normalized linear combination: 

\V) =a\\) + b\2) = ^ j , with \a\ 2 + \b\ 2 = 1. 

Suppose the Hamiltonian matrix is 

H -C O’ 

where g and h are real constants. The (time-dependent) Schrodinger equation says 

d 

HI'!') =ih — 1^>. 
dt 

(a) Find the eigenvalues and (normalized) eigenvectors of this Hamiltonian. 

(b) Suppose the system starts out (at t = 0) in state 11). What is the state at time r? 
Answer. 

I*™ = ( cos te'/*) ) 

Note: This is about the simplest nontrivial quantum system conceivable. It is 
a crude model for (among other things) neutrino oscillations. In that case |1) 
represents the electron neutrino, and |2) the muon neutrino; if the Hamiltonian 
has a nonvanishing off-diagonal term g, then in the course of time the electron 
neutrino will turn into a muon neutrino, and back again. At present this is 
highly speculative—there is no experimental evidence for neutrino oscillations; 
however, a very similar phenomenon does occur in the case of neutral A"-mesons 
(£° and K°). 



CHAPTER 4 


QUANTUM MECHANICS IN 
THREE DIMENSIONS 


4.1 SCHRODINGER EQUATION IN SPHERICAL 
COORDINATES 


The generalization to three dimensions is straightforward. Schrodinger’s equation 
says 

ih—=HV; [4.1] 

the Hamiltonian operator 1 H is obtained from the classical energy 


X -mv 2 + V = p 2 x + p 2 y + p 2 z ) + V 


by the standard prescription (applied now to y and z, as well as x): 

n a n a n a 

Px —y—’ Py^ TT-, Pz T ~ > 

i ox i dy i oz 


or 



[4.2] 


[4.3] 


1 Where confusion might otherwise occur, I have been putting “hats” on operators to distinguish 
them from the corresponding classical observables. I don’t think there will be much occasion for ambiguity 
in this chapter, and the hats get to be cumbersome, so I am going to leave them off from now on. 
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for short. Thus 


ifi - = -V 2 vi f + VV, 

dt 2m 


[4.4] 


where 


V 2 = 


a 2 a 2 

dx 2 + df 


dz 2 


[4.5] 


is the Laplacian, in Cartesian coordinates. 

The potential energy V and the wave function 4> are now functions of r = 
(x, y, z) and t. The probability of finding the particle in the infinitesimal volume 
d 3 r = dx dydx is [^(r, t) | 2 d 3 r, and the normalization condition reads 


/ 


\ty\ 2 d 3 r = 1, 


[4.6] 


with the integral taken over all space. If the potential is independent of time, there 
will be a complete set of stationary states. 


4> n (r,r) = if n (r)e~ iEnt/h 


[4.7] 


where the spatial wave function x[r„ satisfies the time-independent Schrodinger equa¬ 
tion: 

h 2 

-vV„ + V^ n = E n f n . [4.8] 

2m 

The general solution to the ( time-dependent ) Schrodinger equation is 

vp(r, t ) = ^2 c n rff n (r)e- ,Ent / h , [4.9] 

with the constants c n determined by the initial wave function, 4>(r, 0), in the usual 
way. (If the potential admits continuum states, then the sum in Equation 4.9 becomes 
an integral.) 


^Problem 4.1 


(a) Work out all of the canonical commutation relations for components of the 
operators r and p: [x, y], [x, p y ], [x, p x ], lp y , p z ], and so on. Answer 

\Ti, Pj] = -[ Pi,rj ] - ihSij, [n , rj\ -- [Pi , Pj] = 0. [4.10] 


(b) Show that 

4(r) = -(p), and 4{p) = (-VY). [4.11] 

dt m dt 

(Each of these, of course, stands for three equations—one for each component.) 
Hint-. Note that Equation 3.148 is valid in three dimensions. 
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(c) Formulate Heisenberg’s uncertainty principle in three dimensions. Answer: 

VxV Pl >fi/ 2, o y o Py > ft/2, a z o pz >fi/2, [4.12] 

but there is no restriction on, say, o x a Py . 


4.1.1 Separation of Variables 


Typically, the potential is a function only of the distance from the origin. In that case 
it is natural to adopt spherical coordinates, (r, 9 , 0) (see Figure 4.1). In spherical 
coordinates the Laplacian takes the form 2 


1 3 / 2 3 \ 1 3 

r 2 dr V Br) + r 2 sin6> BO 




. [4.13] 


In spherical coordinates, then, the time-independent Schrodinger equation reads 


#\L±(2W\ 1 3 / . df\ 1 /3V\1 

2m [ r 2 dr \ dr ) r 2 sin 9 dO \ d9 ) r 2 sin 2 9 \ dtp 2 ) J 

= Ef. [4.14] 


We begin by looking for solutions that are separable into products: 
f( r> 9,<P) = R(r)Y(9,<P). 

Putting this into Equation 4.14, we have 

Ti 2 

2m | r 2 dr V dr } r 2 sin(? 3 9 

+ V RY = ERY. 


[4.15] 


|_ r 2 dr \ dr J r 2 sin# 3 9 \ 3 9 ) 


R d 2 Yl 
sin 2 9 d<p 2 J 



Figure 4.1: Spherical coordinates: radius r, polar angle 6, and azimuthal angle 4>. 

2 In principle, this can be obtained by change of variables from the Cartesian expression (Equation 
4.5). However, there are much more efficient ways of getting it; see, for instance, M. Boas, Mathematical 
Methods in the Physical Sciences, 2nd ed. (New York: John Wiley and Sons, Inc., 1983) Chapter 10, 
Section 9. 
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Dividing by YR and multiplying by —2 mr 2 /h 2 : 


1 d 

~R ~dr 1 




2m r 2 


dY\ , _ . 

eae \ sme Je) + _a 


-[V(r)-E] 


1 d l Y 


The term in the first curly bracket depends only on r, whereas the remainder depends 
only on 0 and <p; accordingly, each must be a constant. For reasons that will appear 
in due course, I will write this “separation constant” in the form /(/ + l ): 3 


1 

Y 


jii ( sin ^) + si 1^1 = ~ l(>+ ]) - 


[4.16] 

[4.17] 


^Problem 4.2 Use separation of variables in Cartesian coordinates to solve the 
infinite cubical well (or “particle in a box”): 

T ., . f 0 , if x, y, z are all between 0 and a ; 

F( *'* Z) = {oo, otherwise. 

(a) Find the stationary state wave functions and the corresponding energies. 

(b) Call the distinct energies E\, E2, £ 3 ,..., in order of increasing energy. Find 
Ei, E2, £ 3 , £4, £5, and £<s. Determine the degeneracy of each of these energies 
(that is, the number of different states that share the same energy). Recall 
(Problem 2.42) that degenerate bound states do not occur in one dimension, but 
they are common in three dimensions. 

(c) What is the degeneracy of £14, and why is this case interesting? 


4.1.2 The Angular Equation 


Equation 4.17 determines the dependence of ^ on 6 and 0; multiplying by Y sin 9, 
it becomes 

o / dY\ 

sin0 w ( sin ^) + a? = ~ /(, + 1,sin2 * 7 ' [4 ' 181 


3 Note that there is no loss of generality here—at this stage / could be any complex number. Later 
on we’ll discover that / must in fact be an integer, and it is in anticipation of that result that I express the 
separation constant in a way that looks peculiar now. 
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You may have encountered this equation already—it occurs in the solution to Laplace’s 
equation in classical electrodynamics. As always, we try separation of variables: 

7(#,0) = 0(#)O(0). [4.19] 

Plugging this in, and dividing by ©O, we find 

{I [™ e Te ( sin ^)] +l(l + J)sin2(, i + W = a 

The first term is a function only of 6 , and the second is a function only of 0, so each 
must be a constant. This time I’ll call the separation constant m 2A 


i r d , 

eK 1 

^sin#^0J + /(/ + 1) sin 2 # = m 2 ; 

[4.20] 


1 d 2 <Z> - 

-— = -m\ 

$ dtp 2 

[4-21] 

The <p equation is easy: 



d 2 <$> 

dtp 2 

= =>• 0(0) — 

[4.22] 


[Actually, there are two solutions: exp (imp) andexp(— imp), but we’ll cover the latter 
by allowing m to run negative. There could also be a constant factor in front, but we 
might as well absorb that into 0. Incidentally, in electrodynamics we would write the 
azimuthal function (0) in terms of sines and cosines, instead of exponentials, because 
electric potentials must be real. In quantum mechanics there is no such constraint, 
and the exponentials are a lot easier to work with.] Now, when p advances by 2n, 
we return to the same point in space (see Figure 4.1), so it is natural to require that 5 

O(0 + 2?r) = 0(0). [4.23] 

In other words, exp [im(p + 2i r)] = exp (imp), or exp(27n'm) = 1. From this it 
follows that m must be an integer: 

m = 0, ±1,±2,.... [4.24] 


4 Again, there is no loss of generality here since at this stage m could be any complex number; 
in a moment, though, we will discover that m must in fact be an integer. Beware: The letter m is now 
doing double duty, as mass and as the so-called magnetic quantum number. There is no graceful way to 
avoid this since both uses are standard. Some authors now switch to M or p. for mass, but I hate to change 
notation in midstream, and I don’t think confusion will arise as long as you are aware of the problem. 

5 This is a more subtle point than it looks. After all, the probability density ( | O | 2 ) is single valued 
regardless of m. In Section 4.3 we’ll obtain the condition on m by an entirely different—and more 
compelling—argument. 
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The 9 equation, 

sin 0 — (sin0—) +[/(/ + 1) sin 2 9 - m 2 ]@ = 0, [4.25] 

d9 \ dO J 

may not be so familiar. The solution is 

0(<9) = AP™ (cos 9), [4.26] 

where P!” is the associated Legendre function, defined by 6 

pp(x) = (1 - * 2 ) w/2 (-)' ' P,{x), [ 4 . 27 ] 

and Pi(x) is the /th Legendre polynomial. We encountered the latter (Equation 3.91) 
as orthogonal polynomials on the interval (-1, +1); for our present purposes it is 
more convenient to define them by the Rodrigues formula: 

[4 ' 281 

For example, 

P 0 U) = 1, p ^ = \^ x2 ~ X ^ =x ' 

Pl(X) = 4^2 ( rfl ) ( * 2 " 1)2 = 2 <3 * 2 “ ^ 

and so on. The first few Legendre polynomials were listed in Table 3.1. As the name 
suggests, Pi (x) is a polynomial (of degree l) in x, and is even or odd according to the 
parity of /. But PJ n (x) is not, in general, a polynomial—if m is odd it carries a factor 
of \/l — x 2 : 

/>“(*) = X -0x 2 - 1 ), pj(x) = (1 --t 2 ) 1/2 ^; ^Ox 2 - l)j = -X 2 , 

P 2 (x) = (1 - j : 2 ) (JPj [ 2 ( 3* 2 - l)j = 3(1 - x 2 ), 

etc. [On the other hand, what we need is P/”(cos0), and V 1 — cos 2 9 = sin#, so 
P™ (cos 9) is always a polynomial in cos 9 , multiplied—if m is odd—by sin 9. Some 
associated Legendre functions of cos 9 are listed in Table 4.L] 


6 Notice that P } m — Pf*. Some authors adopt a different sign convention for negative values of 
m; see Boas (footnote 2) p. 505. 
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Table 4.1: Some associated Legendre functions, P™(cos6). 


Pi = sin# 

Pl = 15 sin0(1 — cos 2 #) 


if = cos# 

Pi = 3 sin 2 # 

Pi = 3 sin # cos # 

P 2 ° = 1(3cos 2 # - 1) 


Pi = 15 sin 2 # cos# 

Pj = | sin#(5 cos 2 # - 1) 
P 3 ° = \ (5cos 3 # -3cos#) 


Notice that / must be a nonnegative integer for the Rodrigues formula to make 
any sense; moreover, if \m\ > /, then Equation 4.27 says Pj n — 0. For any given /, 
then, there are (2/ + 1) possible values of m: 

l = 0, 1, 2,...; m = -l, -/ + 1,..., -1, 0, 1,..., / - 1, /. [4.29] 


But wait! Equation 4.25 is a second-order differential equation: It should have two 
linearly independent solutions, for any old values of / and m. Where are all the other 
solutions? Answer: They exist, of course, as mathematical solutions to the equation, 
but they are physically unacceptable because they blow up at # =0 and/or 6 = n, 
and do not yield normalizable wave functions (see Problem 4.4). 

Now, the volume element in spherical coordinates 7 is 

d 2 r — r 2 sin # dr dO d(f>, [4.30] 

so the normalization condition (Equation 4.6) becomes 

J \if\ 2 r 2 sin# dr d0 d(}) = J \R\ 2 r 2 dr j \Y\ 2 sin # dO d(J) = 1. 

It is convenient to normalize R and Y individually: 



dr — 1 



(7| 2 sin# d9 d(f> = 1. 


[4.31] 


The normalized angular wave functions 8 are called spherical harmonics: 


7 See, for instance, Boas, (footnote 2), Chapter 5, Section 4. 

8 The normalization factor is derived in Problem 4.47. The e factor is chosen for consistency with 
the notation we will be using in the theory of angular momentum; it is reasonably standard, though some 
older books use other conventions. Notice that 

Y - m = (-1 r Y/ m . 
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Table 4.2: The first few spherical harmonics, 7™ (d. 4 


<-(±f 




COS# 

3 \'/2 


7 0_M 

3 “ Vi6tt 


\ 1/2 , 

| (5 cos 3 6 — 3 cos 6) 


/ 3 \ / 21 \ 

— ) sinOe^ Y±' = *(—J sin 6 (5 cos 2 0 

F o ^ ^_|_y /2 ( 3 cos 2 0 _ 1} y ±2 = (^) 1/2 Sin ^co S0 e^ 

7 2 ±[ = T (^) 1/2 sin9 cos ee± '' 0 7 3 ±3 = T ~) 7 sin 3 0e^ 


- I )** 1 


Y™(0, </>) = c. 


(2/ + 1) (/ - |mp! 

4jt (/ + |m |)! 


/f(cos0). 


[4.32] 


where e = (—l) m for m > 0 and e = 1 for m < 0. As we shall prove later on, they 
are automatically orthogonal, so 


f f [Yp{9,(f>)nYf{e,(i>)]smeded(l> = 8 ll ,8 mm ^ [4.33] 

J 0 Jo 

In Table 4.2 I have listed the first few spherical harmonics. 

* Problem 4.3 Use Equations 4.27, 4.28, and 4.32 to construct 7° and Y \. Check 
that they are normalized and orthogonal. 

Problem 4.4 Show that 

©(0) = A ln[tan(0/2)] 

satisfies the 6 equation (Equation 4.25) for / = m — 0. This is the unacceptable 
“second solution”—what’s wrong with it? 


♦Problem 4.5 Using Equation 4.32, find Yj(0, (f>) and 7 3 2 (0, cf>). Checkthat they sat¬ 
isfy the angular equation (Equation 4.18), for the appropriate values of the parameters 
/ and m. 


♦♦Problem 4.6 Starting from the Rodrigues formula, derive the orthonormality con¬ 
dition for Legendre polynomials: 

f' P,(x)P,,(x)dx = *«'■ [4-34] 

Hint: Use integration by parts. 
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4.1.3 The Radial Equation 


Notice that the angular part of the wave function, Y (9, (j>), is the same for all spherically 
symmetric potentials; the actual shape of the potential, V (r), affects only the radial 
part of the wave function, R(r), which is determined by Equation 4.16: 

Tr - ^¥~ lV(r) - E]R = 1(1 + X)R ' 14351 

This equation simplifies if we change variables: Let 


u(r) = rR(r), 


[4.36] 


so that R = u/r, dR/dr = [ r(du/dr ) - w]/r 2 , ( d/dr)[r 2 (dR/dr )] = rd 2 u/dr z , 
and hence 


h 2 d 2 u 

2md? + 


\V + 


2m 


/(/ + 1) 

r 2 



[4.37] 


This is called the radial equation 9 ; it is identical inform to the one-dimensional 
Schrodinger equation (Equation 2.4), except that the effective potential, 


Veff = V + 


Ti 2 /(/ + 1) 
2m r 2 


[4.38] 


contains an extra piece, the so-called centrifugal term, (7z 2 /2m)[/(/-(-l)/r 2 ]. Ittends 
to throw the particle outward (away from the origin), just like the centrifugal (pseudo-) 
force in classical mechanics. Meanwhile, the normalization condition (Equation 4.31) 
becomes 

\u\ 2 dr= 1. [4.39] 

We cannot proceed further until a specific potential is provided. 


Example. Consider the infinite spherical well, 


V(r) = 


0 , 

oo, 


if r < a\ 
if r > a. 


[4.40] 


Outside the well the wave function is zero; inside the well the radial equation says 


A 

dr 2 


'/(/+ 1 ) 
r 2 



[4.41] 


9 Those m 's are masses, of course—the radial equation makes no reference to the quantum 
number m. 
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where 



ti 


[4.42] 


as usual. Our problem is to solve this equation, subject to the boundary condition 
u{a) = 0. The case / = 0 is easy: 


d 2 u 

dr 2 


= —k 2 u 


u(r) — A sin (kr) + B cos (kr). 


But remember, the actual radial wave function is R(r) = u(r)/r, and [cos(fo*)]/r 
blows up as r -* 0. So 10 we must choose B — 0. The boundary condition then 
requires sin(&a) = 0, and hence ka = ruz, for some integer n. The allowed energies 
are evidently 

n 2 7T 2 h 2 

E n o=^-^, (n = 1,2,3,...), [4.43] 

2 ma 2 

the same as for the one-dimensional infinite square well (Equation 2.23). Normalizing 
u(r) yields A = *j2/a\ inclusion of the angular part (constant, in this instance, since 
y 0 V,tf>) = i/v^F), we conclude that 


1 sin (mzr/a) 

Wn00 = - - 

V27T£* r 


[4.44] 


[Notice that the stationary states are labeled by three quantum numbers, n, l, and 
m: \j/ n i m {r , 9, <f>). The energy , however, depends only on n and /: 

The general solution to Equation 4.41 (for an arbitrary integer /) is not so 
familiar: 

u(r) = Arji(kr) + Brni(kr), [4.45] 

where ji(x) is the spherical Bessel function of order /, and ni(x) is the spherical 
Neumann function of order /. They are defined as follows: 


ji(x) = (-x) 1 



sin * 
x 


, / 1 d V cosx 

ni {x)^-{-x) 1 -. [4.46] 

\x dx j x 


For example, 


MX) - ; 


n 0 (x) - 


cosx 

X 


j](x) = (-x) 


id /sinx\ 
x dx \ x ) 


cos* 


10 Actually, all we require is that the wave function be normalizable, not that it be finite : R{r) ~ 1 jr 
at the origin would be normalizable (because of the r 2 in Equation 4.31). For a more compelling proof 
that 5 = 0, see R. Shankar, Principles of Quantum Mechanics (New York: Plenum, 1980), p. 351. 
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»iW = -(-*) 


1 d /cosjc 
jc dx V x 


sin x 
x 


and so on. The first few spherical Bessel and Neumann functions are listed in 
Table 4.3. Notice that for small jc (where sinjc ^ jc — jc 3 /3! + jj 5 /5! — • • • and 
cos x % 1 - x 2 /2 + jc 4 /4! — •••), 


1 jc 1 

yoOO^l; «oW ^ Mx) ^n iW«—r; 

jc 3 x L 

etc. The point is that the Bessel functions are finite at the origin, but the Neumann 
functions blow up at the origin. Accordingly, we must have = 0, and hence 


R(r) = Ajt(kr). [4.47] 

There remains the boundary condition, R (a) = 0. Evidently k must be chosen 
such that 

J)(ka ) = 0; [4.48] 

that is, ( ka ) is a zero of the / th -order spherical Bessel function. Now the Bessel 
functions are oscillatory (see Figure 4.2); each one has an infinite number of zeros. 
But (unfortunately, for us) they are not located at nice sensible points (such as n, or 
nn , or something); they have to be computed numerically. 11 At any rate, the boundary 
condition requires that 

k = -ft,, [4.49] 

a 


Table 4.3: The first few spherical Bessel and Neumann functions, ji(x) and mix). 


jo = - 


sin x cos x 


cosjc sinjc 


x l (2/ — 1)!! 


cosjc-t sinjc 


11 Abramowitz and Stegun, eds., Handbook of Mathematical Functions (New York: Dover, 1965), 
Chapter 10, provides an extensive listing. 
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Figure 4.2: Graphs of the first four spherical Bessel functions. 

where /?„/ is the n 111 zero of the / th spherical Bessel function. The allowed energies, 
then, are given by 

= [4 ' 501 

and the wave functions are 

fnimiC 6 , 0) = A Hl ji(p n ir/a)Yr(6, 0), [4.51] 

with the constant A n / to be determined by normalization. Each energy level is (2/+1)- 
fold degenerate, since there are (21 + 1 ) different values of m for each value of l (see 
Equation 4.29). 

Problem 4.7 

(a) From the definitions (Equation 4.46), construct 72 (jc) and n, 2 (x). 

(b) Expand the sines and cosines to obtain approximate formulas for 72 (x) and 
H 2 OO, valid when r « 1. Confirm that 72 (jc) is finite at the origin but M 2 OO 
blows up. 


Problem 4.8 

(a) Check that A rj\ ( kr) satisfies the radial equation (Equation 4.37) with V (r) = 0 
and / = 1 . 
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(b) Determine graphically the allowed energies for the infinite spherical well when 
1 = 1. Show that for large n, E ni ^ (h 2 n 2 f2mo 2 )(n + 1/2) 2 . 


**Problem 4.9 A particle of mass m is placed in a finite spherical well: 


V(r) = 


I 


0 , 

Vo, 


if r < a; 
if r > a. 


Find the ground state by solving the radial equation with / = 0. Show that there is 
no bound state at all if Vo a 2 < n 2 h 2 /%m. 


4.2 THE HYDROGEN ATOM _ 

The hydrogen atom consists of a heavy, essentially motionless proton (we may as 
well put it at the origin) of charge e, together with a much lighter electron (charge 
—e) that circles around it, held in orbit by the mutual attraction of opposite charges 
(see Figure 4.3). From Coulomb’s law, the potential energy (in SI units) is 


V(r) = - 


e 2 1 
4ttco r 


[4.52] 


and the radial equation (Equation 4.37) says 


i 2 d 2 u f e 2 1 h 2 1(1 +1)1 

-u-1--— u = Eu. 

dr 2 L 4jreo r 2m r 2 


Our problem is to solve this equation for u(r) and determine the allowed electron 
energies E. The hydrogen atom is such an important case that I’m not going to hand 
you the solutions this time—we’ll work them out in detail by the method we used 
in the analytical solution to the harmonic oscillator. (If any step in this process is 
unclear, you may wish to refer back to Section 2.3.2 for a more complete explana¬ 
tion.) Incidentally, the Coulomb potential (Equation 4.52) admits continuum states 
(with E > 0), describing electron-proton scattering, as well as discrete bound states, 
representing the hydrogen atom, but we shall confine our attention to the latter. 



(electron) 


Figure 4.3: The hydrogen atom. 



134 


Chap . 4 Quantum Mechanics in Three Dimensions 


4.2.1 The Radial Wave Function 


Our first task is to tidy up the notation. Let 

sJ—lmE 


[4.54] 


(For bound states, E < 0, so k is real.) Dividing Equation 4.53 by E, we have 
1 d 2 u _ I" me 2 1 /(/ +1)1 

tc 2 dr 2 L 27T€oh 2 K ( ter) (icr) 2 J U ' 


This suggests that we let 


so that 


p = ter. 

and pq 

d 2 u r 

i- —+ 

P 

dp 2 ~[ 


27iedh z K 


[4.55] 


)1 

- u. 


[4.56] 

Next we examine the asymptotic form of the solutions. As /? -* oo, the constant 
term in the brackets dominates, so (approximately) 

d 2 u 

dp 2 " U ‘ 

The general solution is 

u(p) = Ae~ p + Be?, 

but e p blows up (as p oo), so B = 0. Evidently, 


[4.57] 


u(p) ~ Ae p [4.58] 

for large p. On the other hand, as p -* 0 the centrifugal term dominates 12 ; approxi¬ 
mately, then, 

d 2 u _ /(/+ 1) 

W~—p r ~ u ' 

The general solution (check it!) is 

u(p) = Cp l+l + Dp~ l , 
but p~ l blows up (as p -> 0), so D = 0. Thus 


u(p) - Cp l + l 


[4.59] 


12 This argument does not apply when / = 0 (although the conclusion, Equation 4.59, is in fact 
valid for that case too). But never mind: All I am trying to do is provide some motivation for a change of 
variables (Equation 4.60.) 
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for small p. 

The next step is to peel off the asymptotic behavior, introducing the new function 

v(p): 

u(p) = p +l e P v(p), [4.60] 

in the hope that v(p) will turn out to be simpler than u(p). The first indications are 
not auspicious: 

g = A-[(/ + i-of + pg]. 


d 2 u IT /(/+1)1 dv d 2 v] 

^ = ^-, j [_ 2/ _ 2 + p + ___j u + 2(/ + l-p)- + p^). 

In terms of v(p), then, the radial equation (Equation 4.56) reads 

p^+2(l + \-p)^- + [/>o - 2(1 + 1)]» = 0. [4.61] 

dp 2 dp 

Finally, we assume the solution, v(p), can be expressed as a power series in p: 


v{p) = ^ a jP } - t 4 - 62 J 

7=0 

Our problem is to determine the coefficients ( a o, a \, ai, ...). Differentiating term by 
term, 

T 00 00 

-J- = a i pi ~' = S(7+ D a l+iP J - 

d P U y=o 

[In the second summation I have renamed the “dummy index”: j j + 1. If this 
troubles you, write out the first few terms explicitly, and check it. You might say that 
the sum should now begin at j = —1, but the factor ( / + 1) kills that term anyway, 
so we might as well start at zero.] Differentiating again, 

d ^=iL j{j+i)a j +ipj ~ x ' 


Inserting these into Equation 4.61, we have 


j(j + \)a j+[ p j + 2(1 + 1) + l)a j+ ip J 

7=0 7=0 


-2^ jdjp J + [po - 2(/ + 1 )]^ a jP j = 0* 
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Equating the coefficients of like powers yields 

7(7 + ^) a j +1 + 2(/ + l)(y + l)o/+i — 2 jaj + [p>o — 2 (/ + 1 )]aj = 0, 


or 


a j +1 


2 O' + / + 1) - Po | 

0' + l)0' + 2/ + 2)P 


[4.63| 


This recursion formula determines the coefficients, and hence the function v(p): 
We start with a 0 = A (this becomes an overall constant, to be fixed eventually b> 
normalization), and Equation 4.63 gives us a \; putting this back in, we obtain a 2 , and 
so on. 13 


Now let’s see what the coefficients look like for large j (this corresponds to 
large p, where the higher powers dominate). In this regime the recursion formula 
says 


aj +1 


2 7 

7(7 + 1) 



so 



[4.64] 


Suppose for a moment that this were the exact result. Then 


00 ' 27 

v(p) = A Y —p j = Ae lp , 

7=o J 

and hence 

u{p) = Ap l+l e p , [4.65] 

which blows up at large p. The positive exponential is precisely the asymptotic behav¬ 
ior we didn’t want in Equation 4.57. (It’s no accident that it reappears here; after all. 
it does represent the asymptotic form of some solutions to the radial equation—they 
just don’t happen to be the ones we’re interested in, because they aren’t normalizable.) 
There is only one way out of this dilemma: The series must terminate. There must 
occur some maximal integer, y max , such that 


1 = 0 [4.66] 

(and beyond which all coefficients vanish automatically). Evidently (Equation 4.63) 

2(y’max + / + 1) — Po = 0. 


13 You might wonder why I didn’t use the series method directly on «(p)—why factor out the 
asymptotic behavior before applying this procedure? The reason for peeling off p /+1 is largely aesthetic: 
Without this, the sequence would begin with a long string of zeroes (the first nonzero coefficient being 
o/ + i); by factoring out p l+l we obtain a series that starts out with p°. The e~ p factor is more critical—if 
you don’t pull that out, you get a three-term recursion formula involving a- ;+ 2 , and o/ (try it!), and 
that is enormously more difficult to work with. 
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n — ./max + ^ + 1 

(the so-called principal quantum number), we have 


But pq determines E (Equations 4.54 and 4.55): 


so the allowed energies are 


. [4 (_fiyi _!.= *■, n = 1,2, 3 , 

2h 2 \47teoJ J n 2 n 2 


This is the famous Bohr formula—by any measure the most important result in 
all of quantum mechanics. Bohr obtained it in 1913 by a serendipitous mixture 
of inapplicable classical physics and premature quantum theory (the Schrodinger 
equation did not come until 1924). 

Combining Equations 4.55 and 4.68, we find that 


4tt cqh ) n an 


■ = 0.529 x 10” 1U m 


is the so-called Bohr radius. It follows (again, from Equation 4.55) that 


Evidently the spatial wave functions for hydrogen are labeled by three quantum num¬ 
bers ( n , /, and m): 

% lm (r,e,4>) = Rn l (r)Y?(e,4>), [4.74] 

where (referring back to Equations 4.36 and 4.60) 


Rn,(r) = -p‘ + 'e-»v(p). 


[ 4 . 75 ] 
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and v(p) is a polynomial of degree j max = n — / — 1 in p, whose coefficients are 
determined (up to an overall normalization factor) by the recursion formula 


_ 2 Q’ + / + 1 - n) 

° i+] ~ 0'+l)0' + 2/ + 2)“ ; 


[4.76] 


The ground state (that is, the state of lowest energy) is the case n = 1; putting 
in the accepted values for the physical constants, we get 


Ei =- 



-13.6 eV. 


[4.77] 


Evidently the binding energy of hydrogen (the amount of energy you would have to 
impart to the electron in order to ionize the atom) is 13.6 eV. Equation 4.67 forces 
/ = 0, whence also m — 0 (see Equation 4.29), so 

tioair, 6, 0) = R,o(r)YS(0,0 ). [4.78] 


The recursion formula truncates after the first term (Equation 4.76 with j = 0 yields 
ai = 0), so v(p) is a constant («o) and 


«,<,(/-) = -e~ rl “. [4.79] 

a 

Normalizing it, in accordance with Equation 4.31, 

r IRioWdr = re-W'Sdr = |a 0 | 2 j = 1, 

Jo a 2 J 0 4 

so cio = 2 /yfa. Meanwhile, Y® — 1 /\/4tt, so 


V'looO'. 4 >) — ,— T e r/a - 
\fna s 


lin = 2 the energy is 


£2 


-13.6 eV 
4 


-3.4 eV; 


[4.80] 


[4.81] 


this is the first excited state—or rather, states, since we can have either / = 0 (in 
which case m = 0) or / = 1 (with m = -1, 0, or +1), so there are actually four 
different states that share this energy. If / = 0, the recursion relation (Equation 4.76) 
gives 

a 1 — —ao (using j — 0), and aj = 0 (using j = 1), 
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so v(p) = flo(l — P)> and hence 


^20 ( r ) 


2 a ' 


2a) 


[4.82] 


If / = 1 the recursion formula terminates the series after a single term, so v(p) is a 
constant, and we find 

*2i(r) = 0re-^. [4.83] 

(In each case the constant a 0 is to be determined by normalization—see Problem 
4.11.) 

For arbitrary n, the possible values of l (consistent with Equation 4.67) are 


/ = 0, 1,2, 1. 


[4.84] 


For each /, there are (2/ + 1) possible values of m (Equation 4.29), so the total 
degeneracy of the energy level E n is 

n -1 

d{n) = ^(2/ + 1) = n 2 . [4.85] 

/—o 


The polynomial v(p) (defined by the recursion formula, Equation 4.76) is a function 
well known to applied mathematicians; apart from normalization, it can be written as 


V(p) = L*+'_ x (2p), 

[4.86] 

where 


/ d \ p 

Lq- P (x ) = (-1)^ f — J L q {x) 

[4.87] 

is an associated Laguerre polynomial, and 


L q (x) = e?(J^j (e^x^) 

[4.88] 


is the ^rth Laguerre polynomial. 14 (The first few Laguerre polynomials are listed in 
Table 4.4; some associated Laguerre polynomials are given in Table 4.5. The first 
few radial wave functions are listed in Table 4.6 and plotted in Figure 4.4.) The 
normalized hydrogen wave functions are 15 


[4.89] 


14 As usual, there are rival normalization conventions in the literature; I have adopted the most nearly 
standard one. 

15 If you want to see how the normalization factor is calculated, study (for example), L. Schiff, 
Quantum Mechanics, 2nd ed. (New York: McGraw-Hill, 1968), page 93. 
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Table 4.4: The first few Laguerre polynomials, L q (x). 

Lq = 1 

Ll = -x + l 

L 2 = x 2 - 4x + 2 

£3 = —x 3 + 9x 2 — 18x + 6 

L 4 = x 4 - 16x 3 + 72x 2 - 96x + 24 

L 5 = -x 5 + 25x 4 - 200x 3 + 600x 2 - 600x + 120 

I 6 = x 6 - 36x 5 + 450x 4 - 2400x 3 + 5400x 2 - 4320x + 720 


Table 4.5: Some associated Laguerre polynomials, L p q _ p (x). 


^0 = 1 

L\ = 2 

L\ = -X + 1 

l\ = — 6x + 18 

Z,0 = x 2 - 4x + 2 

L\ = 12x 2 - 96x + 144 

L l = 1 

L 3 = 6 

L\ = — 2x + 4 

L\ - — 24x + 96 

L\ = 3x 2 - 18x + 18 

L\ = 60x 2 - 600x + 1200 


They are not pretty, but don’t complain—this is one of the very few realistic systems 
that can be solved at all, in exact closed form. As we will prove later on, they are 
mutually orthogonal: 

J is nim tyn'Vm' r 2 sin9 dr d0 d(p = [4.90] 


^Problem 4.10 Work out the radial wave functions R 3Q , R 3l , and R 32 , using the 
recursion formula (Equation 4.76). Don’t bother to normalize them. 

^Problem 4.11 

(a) Normalize R 2 o (Equation 4.82), and construct the function \f/ 2 oo- 

(b) Normalize R 2 \ (Equation 4.83), and construct t/f 2 n, i/' 210 , and 1 ^ 21 - 1 - 

** Problem 4.12 


(a) Using Equation 4.88, work out the first four Laguerre polynomials. 

(b) Using Equations 4.86, 4.87, and 4.88, find v(p) for the case n =5,1 = 2. 
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Table 4.6: The first few radial wave functions for hydrogen, R„i(r). 



Figure 4.4: Graphs of the first few hydrogen radial wave functions, R„i(r). 
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(c) Again, find v(p) for the case n = 5,1 = 2, but this time get it from the recursion 
formula (Equation 4.76). 

^Problem 4.13 

(a) Find {r} and (r 2 ) for an electron in the ground state of hydrogen. Express your 
answers in terms of the Bohr radius a. 

(b) Find (x) and (x 2 ) for an electron in the ground state of hydrogen. Hint: This 
requires no new integration—note that r 2 = x 2 + y' 2 + z 2 , and exploit the 
symmetry of the ground state. 

(c) Find (x 2 ) in the state n = 2, l = 1, m = 1. Hint : This state is not symmetrical 
in x, y, z. Use x = r sin# cos</>. 

Problem 4.14 What is the probability that an electron in the ground state of hy¬ 
drogen will be found inside the nucleusl 

(a) First calculate the exact answer, assuming that the wave function (Equation 
4.80) is correct all the way down to r = 0. Let b be the radius of the nucleus. 

(b) Expand your result as a power series in the small number e = 2 bja, and show 

that the lowest-order term is the cubic: P (4/3)(6/a) 3 . This should be a 

suitable approximation, provided that b a (which it is). 

(c) Alternatively, we might assume that yfr{r) is essentially constant over the (tiny) 
volume of the nucleus, so that P % (4/3)tt6 3 |^(0)| 2 . Check that you get the 
same answer this way. 

(d) Use b & 10 _15 m and a & 0.5 x 10 _10 m to get a numerical estimate for 
P. Roughly speaking, this represents the “fraction of its time that the electron 
spends inside the nucleus”. 

Problem 4.15 

(a) Use the recursion formula (Equation 4.76) to confirm that when l — n — 1 the 
radial wave function takes the form 

R n{n - 1) = N n r n - [ e- r/na , 

and determine the normalization constant N„ by direct integration. 

(b) Calculate { r) and (r 2 ) for states of the form ^«(n-i)m- 

(c) Show that o r — (r) j«j2n + 1 for such states. Note that the fractional spread in 
r decreases with increasing n (in this sense the system “begins to look classical” 
for large n). Sketch the radial wave functions for several values of n to illustrate 
this point. 



Sec. 4.2: The Hydrogen Atom 143 


4.2.2 The Spectrum of Hydrogen 


In principle, if you put a hydrogen atom into some stationary state ^„i m , it should 
stay there forever. However, if you tickle it slightly (by collision with another atom, 
say, or by shining light on it), then the atom may undergo a transition to some other 
stationary state—either by absorbing energy and moving up to a higher-energy state, 
or by giving off energy (typically in the form of electromagnetic radiation) and moving 
down. 16 In practice such perturbations are always present; transitions (or, as they are 
sometimes called, “quantum jumps”) are constantly occurring, and the result is that 
a container of hydrogen gives off light (photons), whose energy corresponds to the 
difference in energy between the initial and final states: 


Ey = Ei - Ef — 



[4.91] 


Now, according to the Planck formula, 17 the energy of a photon is proportional 
to its frequency: 

Ey=hv. [4.92] 

Meanwhile, the wavelength is given by A, — cjv, so 


1 

i = * 



[4.93] 


where 

m ( e 2 \ 2 7 i 

R = -r - = 1.097 x 10 7 m _1 . [4.94] 

Aitch \47T6o / 

R is known as the Rydberg constant, and Equation 4.93 is the Rydberg formula for 
the spectrum of hydrogen. It was discovered empirically in the nineteenth century, 
and the greatest triumph of Bohr’s theory was its ability to account for this result— 
and to calculate R in terms of the fundamental constants of nature. Transitions to the 
ground state («/ = 1) lie in the ultraviolet; they are known to spectroscopists as the 
Lyman series. Transitions to the first excited state (rif = 2) fall in the visible region; 
they constitute the Balmer series. Transitions to n f — 3 (the Paschen series) are in 
the infrared, and so on (see Figure 4.5). (At room temperature, most hydrogen atoms 
are in the ground state; to obtain the emission spectrum, you must first pump them 
up into the various excited states; typically this is done by passing an electric spark 
through the gas.) 


16 By its nature, this involves a time-dependent interaction, and the details will have to wait for 
Chapter 9; for our present purposes the actual mechanism involved is immaterial. 

17 The photon is a quantum of electromagnetic radiation; it’s a relativistic object if there ever was 
one, and therefore outside the scope of nonrelativistic quantum mechanics. It will be useful in a few places 
to speak of photons and to invoke the Planck formula for their energy, but please bear in mind that this is 
external to the theory we are developing. 
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Problem 4.16 Consider the earth-sun system as a gravitational analog to the hy¬ 
drogen atom. 

(a) What is the potential energy function (replacing Equation 4.52)? (Let m be the 
mass of the earth and M the mass of the sun.) 

(b) What is the “Bohr radius” for this system? Work out the actual numerical value. 

(c) Write down the gravitational “Bohr formula”, and, by equating E n to the clas¬ 
sical energy of a planet in a circular orbit of radius r 0 , show that n = Qr^ja. 
From this, estimate the quantum number n of the earth. 

(d) Suppose the earth made a transition to the next lower level (n - 1). How much 
energy (in Joules) would be released? What would the wavelength of the emitted 
photon (or, more likely, graviton) be? 

❖Problem 4.1 7 A hydrogenic atom consists of a single electron orbiting a nucleus 
with Z protons. (Z = 1 would be hydrogen itself, Z = 2 is ionized helium, Z = 3 is 
doubly ionized lithium, and so on.) Determine the Bohr energies E n (Z), the binding 
energy E\ (Z), the Bohr radius a (Z), and the Rydberg constant R(Z) for a hydrogenic 
atom. (Express your answers as appropriate multiples of the hydrogen values.) Where 
in the electromagnetic spectrum would the Lyman series fall, for Z = 2 and Z = 3? 
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4.3 ANGULAR MOMENTUM _ 

In classical mechanics, the angular momentum of a particle (with respect to the origin) 
is given by the formula 

L = r x p, [4.95] 

which is to say, 

L x — yp 2 — zp y , L y = zp x -xp z , and L z —xp y -yp x . [4.96] 

The corresponding quantum operators are obtained by the standard prescription 
(Equation 4.2): 



In the following sections we will deduce the eigenvalues and eigenfunctions of these 
operators. 


4.3.1 Eigenvalues 


L x and L y do not commute; in fact [providing a test function, /(x, y, z), for them to 
act upon]: 


lL x ,L y ]f 



a 2 / 2 a 2 / a 2 / a/ a 2 /\ 

~ zy TT~ + z +xy TT ~ x ^~ ~ xz t^~ )• 

dxdz dxdy dz 2 dy dzdy J 



146 Chap. 4 Quantum Mechanics in Three Dimensions 


All the terms cancel in pairs (by virtue of the equality of cross-derivatives) except 
two: 

and we conclude (dropping the test function) 


[L x ,L y ] = ihL z . 


[4.98] 


By cyclic permutation of the indices it follows also that 

[L y ,L z \ = itiL x and [L z , L x ] = ifiL y . [4.99] 

From these fundamental commutation relations the entire theory of angular momen¬ 
tum can be deduced. 

Evidently L x , L y , and L z are incompatible observables. According to the 
generalized uncertainty principle (Equation 3.139), 

° f * 

o l o Lv >-\(L z )\. [4.100] 

It would therefore be futile to look for states that are simultaneously eigenfunctions 
of L x and of L y . On the other hand, the square of the total angular momentum, 

L 2 = L 2 X + L 2 y + L 2 z , [4.101] 

does commute with L x : 

[L 2 ,L X ] = [L 2 x ,L x ] + [L 2 y ,L x ] + [L 2 ,L x ] 

= L y [L y , L x ] + [Ly, L x ]L y + L Z [L z , L x ] + [L z , L X ]L Z 
= L y (-ifiL z ) + (-ifiL z )L y + L z (itiL y ) + ( ifiL y )L z 
- 0 . 

(I used Equation 3.142 and the fact that any operator commutes with itself. 18 ) It 
follows, of course, that L 2 also commutes with L y and L z : 

[L 2 ,L x ] = 0, [L 2 ,L y ]= 0, [L 2 ,L z ] = 0, [4.102] 


18 Note that all the operators we encounter in quantum mechanics (see footnote 8, Chapter 1) are 
linear, in the sense that A(f + g) = A f + Ag, and therefore distributive with respect to addition: 
A(B + C) = AB + AC. In particular, [A, B + C] — [A, B ] + [A, C], 
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or, more compactly, 

[L 2 ,L]^0. [4.103] 

So L 2 is compatible with each component of L, and we can hope to find simultaneous 
eigenstates of L 2 and (say) L z . 

L 2 f = A/ and L z f =/if [4.104] 

We’ll use a “ladder operator” technique, very similar to the one we applied to 
the harmonic oscillator back in Section 2.3.1. Let 

L ± = L x ±iL y . [4.105] 


Its commutator with L z is 

[L z , L±] = [L z , L x ] ± i[L z , L y ] = ihL y ± i(—ihL x ) = ±ft(L x ± iL y ), 


so 


[L z , L ± ] = ±hL ± . 


[4.106] 


And, of course, 

[L 2 ,L±\= 0. [4.107] 

I claim that if / is an eigenfunction of L 2 and L z , so also is L±f. For Equation 4.107 
says 

L 2 (L ± f) = L ± (L 2 f) = L ± (Xf) = X(L ± f), [4.108] 

so L±f is an eigenfunction of L 2 , with the same eigenvalue A, and Equation 4.106 
says 


L z (L ± f) = (L Z L± - L±L z )f+L±L z f = ±TiL ± f+ L ± (jif) 

[4.109] 

= (n±h)(L±f), 

so L ± f is an eigenfunction of L z with the new eigenvalue a ± h. L + is called the 
“raising” operator because it increases the eigenvalue of L z by h, and L_ is called 
the “lowering” operator because it lowers the eigenvalue by h. 

For a given value of A, then, we obtain a “ladder” of states, with each “rung” sep¬ 
arated from its neighbors by one unit of Ti in the eigenvalue of L z (see 
Figure 4.6). To ascend the ladder we apply the raising operator, and to descend, 
the lowering operator. But this process cannot go on forever: Eventually we’re going 
to reach a state for which the z-component exceeds the total, and that cannot be (see 
Problem 4.18). So there must exist a “top rung,” f t , such that 19 


L+f = 0. 


[4.110] 


19 Actually, all we can conclude is that L + f t is not normalizable —its norm could be infinite, instead 
of zero. Problem 4.19 eliminates this alternative. 
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Figure 4.6: The “ladder” of angular 
momentum states. 


Let Til be the eigenvalue of L z at this top rung (the appropriateness of the letter 
/—sometimes called the azimuthal quantum number— will appear in a moment): 

Lj,=nif,-, L 2 f, = xf„ [4.H1] 


Now 

L ± L r = (L x ± iL,)(L, T iLy) = L\ + L\ T i(L x L y - L,L X ) 

= L 2 -L\^HiTiL z ), 

or, putting it the other way around, 

L 2 = L ± L^ + L 2 z ^hL z . [4.112] 

It follows that 

L 2 f t = (L.L+ + L\ + T,L z )f, = (0 + a 2 / 2 + h 2 l)f, = h 2 l(l + 1 )/„ 
and hence 

\=h 2 l(l + 1). [4.113] 

This tells us the eigenvalue of L 2 in terms of the maximum eigenvalue of L z . 

Meanwhile, there is also (for the same reason) a bottom rung, f b , such that 

L-fb = 0. [4.114] 

Let hi be the eigenvalue of L z at this bottom rung: 

LJ b =Mf b \ L 2 f b = Xf b . [4.115] 

Using Equation 4.112, we have 

L 2 f b = (L+L- + l\ - hL z )f b = (0 + » 2 P - h 2 I)f b = n 2 I(i- l )f b . 
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and therefore 

X=n 2 I(I-l). [4.116] 

Comparing Equations 4.113 and 4.116, we see that /(/ + 1) = /(/ — 1), so either 
/ = / + 1 (which is absurd—the bottom rung is higher than the top rung!), or else 

I=-l. [4.117] 

Evidently the eigenvalues of L z are mfi, where m (the appropriateness of this 

letter will also be clear in a moment) goes from —l to +/ in N integer steps. In 
particular, it follows that l = —l + N, and hence / = NJ2, so / must be an integer or 
a half-integer. The eigenfunctions are characterized by the numbers / and m: 

L 2 fJ" =nH(l+l)JT; L z JT=hmfT, [4.118] 

where 

/ = 0, 1/2, 1,3/2,...; m — —l, —/ -j- 1 ,..., / — 1, /. [4.119] 

For a given value of /, there are 21 + 1 different values of m (i.e., 21 4- 1 “rungs” on 
the “ladder”). 

I hope you’re impressed: By purely algebraic means, starting with the fun¬ 
damental commutation relations (Equations 4.98 and 4.99), we have determined the 
eigenvalues of L 2 and L z —without ever seeing the eigenfunctions themselves! We 
turn now to the problem of constructing the eigenfunctions, but I should warn you 
that this is a much messier business. Just so you know where we’re headed. I’ll tell 
you the punch line before we begin: ff 1 = Yf —the eigenfunctions of L 2 and L z are 
nothing but the old spherical harmonics, which we came upon by a quite different 
route in Section 4.1.2 (that’s why I chose the letters l and m, of course). 

Problem 4.18 

(a) Prove that if / is simultaneously an eigenfunction of L 2 and of L z (Equa¬ 
tion 4.104), the square of the eigenvalue of L z cannot exceed the eigenvalue of 
L 2 . Hint : Examine the expectation value of L 2 . 

(b) As it turns out (see Equations 4.118 and 4.119), the square of the eigenvalue of L z 
never even equals the eigenvalue of L 2 (except in the special case l = m = 0). 
Comment on the implications of this result. Show that it is enforced by the 
uncertainty principle (Equation 4.100), and explain how the special case gets 
away with it. 


❖Problem 4.19 The raising and lowering operators change the value of m by one 
unit: 

L±/r = (A'!‘)fr ± ', [4.120] 

where A™ is some constant. Question : What is A™, if the eigenfunctions are to be 
normalized ? Hint: First show that is the Hermitian conjugate of L± (since L x 
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and L y are observables, you may assume they are Hermitian, but prove it if you like i. 
then use Equation 4.112. Answer : 

Af = tiy/1(l +1) — m(m i 1). [4.121; 

Note what happens at the top and bottom of the ladder. 

♦Problem 4.20 

(a) Starting with the canonical commutation relations for position and momentum. 
Equation 4.10, work out the following commutators: 

[L z ,x] = ihy, [L z , y\ =-ihx, [L z ,z] = 0 ^ 

[L z , p x ] = ihp y , \L Z , p y \ = —iftpx, [L z ,p z ] = 0. 

(b) Use these results to obtain \L Z , L x ] = ThL y directly from Equation 4.96. 

(c) Evaluate the commutators [L z , r 2 ] and [L z , p 2 ] (where, of course, r 2 = 
x 2 + / + z 2 and p 2 = p 2 + p 2 y 4- p 2 z ). 

(d) Show that the Hamiltonian H — (p 2 /2m) + V commutes with all three com¬ 
ponents of L, provided that V depends only on r. (Thus II, L 2 , and L z are 
mutually compatible observables.) 

♦♦Problem 4.21 

(a) Prove that for a particle in a potential V (r) the rate of change of the expectation 
value of the orbital angular momentum L is equal to the expectation value of 
the torque: 

^L) = < N ), 

where 

N = r x (-VV). 

(This is the rotational analog to Ehrenfest’s theorem.) 

(b) Show that d{L)/dt = 0 for any spherically symmetric potential. (This is one 
form of the quantum statement of conservation of angular momentum.) 


4.3.2 Eigenfunctions 


First of all we need to rewrite L x , L y , and L z in spherical coordinates. Now L = 
(h/i)(r x V), and the gradient, in spherical coordinates, is 20 


_ „ 9 d a ~ l 9 

V = r -h 0 -(- 0-; 

dr rdd rsin0 90 


[4.123] 


20 George Arfken, Mathematical Methods for Physicists, 3rd ed. (Orlando, FL: Academic Press. 
1985), Section 2.5. 
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meanwhile, r = rr, so 


Ti T , , 3 „ ~ 3 1 3 

L=t r(r xr)—+ (r x0)—+ (r x0)-—— 
i \_ or o9 sin# 30 


But (r x r) — 0, (r x 6) = 0, and (r x 0) = —9 (see Figure 4.1), and hence 

l = *^±_«_LAV [4 

i yd0 sin 9d<t>) 


The unit vectors 9 and 0 can be resolved into their Cartesian components: 

9 = (cos 9 cos 4>)i + (cos 9 sin 0) j — (sin 9)k\ [4.125] 

0 = — (sin0)f+ (cos 0)/. [4.126] 


L = t f"(— sin 4>i + cos 07) -— — (cos 9 cos 0f + cos 0 sin 0/ — sin 9k) 


n ( . ± 3 ± 3 

L * = j{- Sm4, 30- C ° S4,C ° l % 


h ( 3 3 

Ly = T T cos 0 —— - sin 0 cot # — 
i \ 09 0q> 


n 3 

Lz — ~ TT * f 

i 30 

We shall also need the raising and lowering operators: 

h r 3 3 

T± = L x ± iL y — - (— sin0 ± i cos0)--(cos0 ± i sin 0) cot 9 — 

i |_ o9 30 


But cos 0 ± / sin 0 — e ±l ^, so 


L± = ±he ±i4> [ — ±i cot 9 


We are now in a position to determine f™(9, 0) (I’ll drop the subscript and 
superscript for now). It’s an eigenfunction of L z , with eigenvalue hm: 

„ hBf 

L zf = t 7T =hmf, 
i 0(p 


f = g(9)e in 
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[Here g(9) is a constant of integration, as far as 0 is concerned, but it can still depend 
on #.] And / is also an eigenfunction of L 2 (which we’ll write in terms of L± and 
L z , using Equation 4.112), with eigenvalue h 2 l(l + 1): 


L 2 f=(L + L- + L 2 z -hL z )f 

= h *(i + ,a * e £i) (-*,-*) (g 

= i, 2 i(i + 1 )/. 


i cot 9 


A 

d<t>) 


90 2 i 30 


But in view of Equation 4.131, df/39 = e im ^dgfd9 and 3//30 = ime tm ^g, so 
+i cot9 J^j (e i(m ~ m ) + mgcot#^ + m 2 ge im(p - mge im<p 

= (S + mgco,e ) + - 1 > cote (| +m ? cote ) 

+m(m - l)g] = /(/ + 1 )ge ,m4> . 

Canceling e im<i> , 

| - ra—| cot# + mg esc 2 9 + (m - l)cot #^| +m(w - 1)(1 +cot 2 9)g 


= -^f - cot 0 +w 2 g CSC 2 9 = l(l 4- l)g, 
or, multiplying through by — sin 2 9 : 

sin 2 ^^-| + sin# cos#^ — m 2 g =—/(/ + 1) sin 2 #g. 
du 1 du 

This is a differential equation for g(#); it can be written in a more familiar form: 

sin 9 ^ ^sin 9 ^ + [/(/ + 1) sin 2 9 - m 2 ]g = 0. [4.132] 

But this is precisely the equation for the ^-dependent part, ©(#), of Y™ (9, 0) (compare 
Equation 4.25). Meanwhile, the ^-dependent part of / (to wit, is identical 
to 0(0) (Equation 4.22). Conclusion: The spherical harmonics are precisely the 
(normalized) eigenfunctions of L 2 and L z . 

When we solved the Schrodinger equation by separation of variables, in Section 
4.1, we were inadvertantly constructing simultaneous eigenfunctions of the three 
commuting operators H, L z , and L z : 
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Hf = Eijr, L 2 f=Ti 2 l(l + 1)0, L z x{r=hmf. [4.133] 

But there is a curious twist to this story, for the algebraic theory of angular momentum 
permits / (and hence also m) to take on ^//-integer values (Equation 4.119), whereas 
the analytic method yielded eigenfunctions only for integer values (Equation 4.29). 
You might reasonably guess that the half-integer solutions are spurious, but it turns 
out that they are of profound importance, as we shall see in the following sections. 

♦Problem 4.22 

(a) What is L + Yfl (No calculation allowed!) 

(b) Use the result of (a), together with the fact that L z Yf = hlYj, to determine 
Yf (6, 0), up to a normalization constant. 

(c) Determine the normalization constant by direct integration. Compare your final 
answer to what you got in Problem 4.5. 

Problem 4.23 In Problem 4.3 you showed that 

Yl(Q,<j>) — —15/8jr sin 9 cos 0e l<l> . 

Apply the raising operator to find Y\(Q, 0). Use Equation 4.121 to get the normal¬ 
ization. 

Problem 4.24 


(a) Prove that the spherical harmonics are orthogonal (Equation 4.33). Hint. This 
requires no calculation, if you invoke the appropriate theorem. 

(b) Prove the orthogonality of the hydrogen wave functions 0„/ m (r, 6, 0) (Equa¬ 
tion 4.90). 

Problem 4.25 Two particles of mass m are attached to the ends of a massless rigid 
rod of length a. The system is free to rotate in three dimensions about the center (but 
the center point itself is fixed). 


(a) Show that the allowed energies of this rigid rotor are 


E r 


h 2 n{n + 1) 
ma 2 


for n= 0,1,2,... 


Hint: First express the (classical) energy in terms of the total angular momentum. 

(b) What are the normalized eigenfunctions for this system? What is the degeneracy 
of the nth energy level? 
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4.4 SPIN_ 

In classical mechanics, a rigid object admits two kinds of angular momentum: orbital 
(L = r x p), associated with the motion of the center of mass, and spin (S = lea). 
associated with motion about the center of mass. For example, the earth has orbital 
angular momentum attributable to its annual revolution around the sun, and spin 
angular momentum coming from its daily rotation about the north-south axis. In 
the classical context this distinction is largely a matter of convenience, for when 
you come right down to it, S is nothing but the sum total of the “orbital” angular 
momenta of all the rocks and dirt clods that go to make up the earth, as they circle 
around the axis. But an analogous thing happens in quantum mechanics, and here 
the distinction is absolutely fundamental. In addition to orbital angular momentum, 
associated (in the case of hydrogen) with the motion of the electron around the nucleus 
(and described by the spherical harmonics), the electron also carries another form 
of angular momentum, which has nothing to do with motion in space (and which is 
not, therefore, described by any function of the position variables r,9,<f>) but which 
is somewhat analogous to classical spin (and for which, therefore, we use the same 
word). It doesn’t pay to press this analogy too far: The electron (as far as we know) is 
a structureless point particle, and its spin angular momentum cannot be decomposed 
into orbital angular momenta of constituent parts (see Problem 4.26). 21 Suffice it to 
say that elementary particles carry intrinsic angular momentum (S) in addition to 
their “extrinsic” angular momentum (L). 

The algebraic theory of spin is a carbon copy of the theory of orbital angular 
momentum, beginning with the fundamental commutation relations 22 : 

[S x ,S y ] = ihS z , [S y ,S z ] = ihS x , [S z , S x ] = ihS y . [4.134] 

It follows (as before) that the eigenvectors of S 2 and S z satisfy 23 

S 2 \sm) = h 2 s(s + l)|sm); S z \sm) = hm\s m); [4.135] 

and ^__ 

S±\sm) = hy/s(s + 1) - m(m ± 1) |s (m ± 1)), [4.136] 


2 Tor a contrary interpretation, see Hans C. Ohanian, “What is Spin?’’, Am. J. Phys. 54 , 500 (1986). 

22 We shall take these as postulates for the theory of spin; the analogous formulas for orbital angular 
momentum (Equations 4.98 and 4.99) were derived from the known form of the operators (Equation 4.97). 
In a more sophisticated treatment they can both be obtained from the rotational invariance of the three- 
dimensional world [see, for example, Leslie E. Ballentine, Quantum Mechanics (Englewood Cliffs, NJ: 
Prentice Hall, 1990), Section 3.3], Indeed, these fundamental commutation relations apply to all forms 
of angular momentum, whether spin, orbital, or the combined angular momentum of a composite system, 
which could include some spin and some orbital. 

23 Because the eigenstates of spin are not junctions, I revert to the “ket” notation for them. (I could 
have done the same in Section 4.3, writing |/ m) in place of Yj , but in that context the function notation 
seems more natural.) By the way, I’m running out of letters, so I’ll use m for the eigenvalue of S z , just as 
1 did for L z (some authors write mi and m s at this stage, just to be absolutely clear). 
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where S± = S x ± iS y . But this time the eigenvectors are not spherical harmonics 
(they’re not functions of 6 and <j> at all), and there is no a priori reason to exclude the 
half-integer values of s and m: 

1 3 

s — 0, —, 1, m = —s, -s + 1, ..., s — 1, s. [4.137] 

It so happens that every elementary particle has a specific and immutable value 
of s, which we call the spin of that particular species: pi mesons have spin 0; electrons 
have spin 1/2; photons have spin 1; deltas have spin 3/2; gravitons have spin 2; and so 
on. By contrast, the orbital angular momentum quantum number l (for an electron in a 
hydrogen atom, say) can take on any (integer) value you please, and will change from 
one to another when the system is perturbed. But s is fixed, for any given particle, 
and this makes the theory of spin comparatively simple. 24 


Problem 4.26 If the electron is a classical solid sphere, with radius 


4n<Eomc 2 ’ 


[4.138] 


(the so-called classical electron radius, obtained by assuming that the electron’s 
mass is attributable to energy stored in its electric field, via the Einstein formula 
E = me 2 ), and its angular momentum is (l/2)7z, then how fast (in m/s) is a point on 
the “equator” moving? Does this model for spin make sense? (Actually, the radius 
of the electron is known experimentally to be much less than r c , but this only makes 
matters worse.) 


4.4.1 Spin 1/2 

By far the most important case is s = 1/2, for this is the spin of the particles that 
make up ordinary matter (protons, neutrons, and electrons), as well as all quarks and 
all leptons. Moreover, once you understand spin 1/2, it is a simple matter to work 
out the formalism for any higher spin. There are just two eigenstates: \\\), which 
we call spin up (informally, t), and (— ^)), which we call spin down (|). Using 
these as basis vectors, the general state of a spin-1/2 particle can be expressed as a 
two-element column matrix (or spinor): 

X = ( a b j=a x+ + b x -, [4.139] 


24 Indeed, in a mathematical sense, spin 1/2 is the simplest possible nontrivial quantum system, for 
it admits just two possible states. In place of an infinite-dimensional Hilbert space, with all its subtleties 
and complications, we find ourselves working in an ordinary two-dimensional vector space; in place of 
unfamiliar differential equations and fancy functions, we are confronted with 2x2 matrices and two- 
component vectors. For this reason, some authors begin quantum mechanics with a treatment of the 
spin-1/2 system. But the price of mathematical simplicity is conceptual abstraction, and I prefer not to do 
it that way. 
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with 


representing spin up, and 


-W 

-(!) 


[4.140] 


[4.141] 


for spin down. Meanwhile, the spin operators become 2x2 matrices, which we can 
work out by noting their effect on x+ and Equation 4.135 says 


S 2 X+ = s2 X- = \ nl X-' SzX + = \ x+; SzX ~ 

and Equation 4.136 gives 

S + x- - hx+\ S-X+ = kX-\ S+X+ = S-X- = 0. 
Now, S± = S x ± i S y , so 

S, = 1(S + + X.) and 5, = 2(S + _S_), 

and it follows that 


r*-; [4-142] 


[4.143] 


[4.144] 


Sxx+ = \x~\ s x x- = ^x+-, s y x+ = s y x ~ = Yi x+ ' [4-145] 


Thus 


while 




It’s a little tidier to divide off the factor of ft/2: S = (ft/2)er, where 

»-(:;)->-(? v>-(i *.)• 

These are the famous Pauli spin matrices. Notice that S x , S y , S z , and S 2 are all 
Hermitian (as they should be, since they represent observables). On the other hand, 
S + and S- are not Hermitian—evidently they are not observable. 

The eigenspinors of S z are (of course) 


[4.146] 

[4.147] 

[4.148] 


X+ : 


W fei 


(eigenvalue + -); x~ : 


■(!)• 


(eigenvalue - -). [4.149] 
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If you measure S z on a particle in the general state x (Equation 4.139), you could get 
+h/2, with probability \a\ 2 , or —hj 2, with probability \b\ 2 . Since these are the only 
possibilities, 

|a| 2 + |*| 2 =l [4.150] 

(i.e., the spinor must be normalized). 25 

But what if, instead, you chose to measure S x ? What are the possible results, 
and what are their respective probabilities? According to the generalized statisti¬ 
cal interpretation, we need to know the eigenvalues and eigenspinors of S x . The 
characteristic equation is 


—A 

h/2 


n /2 


=o = 




Not surprisingly, the possible values for S x are the same as those for S z . The eigen¬ 
spinors are obtained in the usual way: 


'(::)(;)-4 

so fi = ±a. Evidently the (normalized) eigenspinors of S x are 

X+ ) = { ^ Y (eigenvalue + -); X- ] = ( Y (eigenvalue — ^). [4.151] 
2 2 

As the eigenvectors of a Hermitian matrix, they span the space; the generic spinor x 
can be expressed as a linear combination of them: 

x = ’ + (a/5") X “ *' [41521 

If you measure S x , the probability of getting +h/2 is (1/2) | a + b\ 2 , and the prob¬ 
ability of getting — hj2 is (\/2)\a — b\ 2 . (You should check for yourself that these 
probabilities add up to 1.) 


Example. Suppose a spin 1/2 particle is in the state 



If you measure S z , the probability of getting +Ti/2 is |(1 + /')/V6| 2 = 1/3, and the 
probability of getting —h/2 is |2/>/6| 2 = 2/3. If you measure S x , the probability of 


25 PeopIe often say that \a\ 2 is the “probability that the particle is in the spin-up state”, but this 
is sloppy language; the particle is in state x — not X i —and what the speaker really means is that if you 
measured S z , \a\ 2 is the probability you’d get h /2, which is an entirely different assertion. 
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getting +h/2 is (1/2) |(3 + i)/V6\ 2 = 5/6, and the probability of getting —ft/2 is 
(1/2)|(— 1 + /)/V6| 2 = 1/6. Evidently the expectation value of S x is 


5 

6 


6D*i(-i) 


ft_ 

3’ 


which we could also have obtained more directly: 




vs) (m 


n /2 \ ( (l + o/Ve \ 
0 / V 2/V6 ) 


h 

3' 


I’d like now to walk you through an imaginary measurement scenario involving 
spin 1/2, because it serves to illustrate in very concrete terms some of the abstract ideas 
we discussed back in Chapter 1 . Let’s say we start out with a particle in the state x+ ■ If 
someone asks, “What is the z-component of that particle’s spin angular momentum?", 
we could answer unambiguously: +ft /2. For a measurement of S z is certain to return 
that value. But if our interrogator asks instead, “What is the x-component of that 
particle’s spin angular momentum?”, we are obliged to equivocate: If you measure 
S x , the chances are 50-50 of getting either ft/2 or -ft(2. If the questioner is a 
classical physicist, or a “realist” (in the sense of Section 1.2), he will regard this as an 
inadequate—not to say impertinent—response: “Are you telling me that you don 7 
know the true state of that particle?” On the contrary; I know precisely what the state 
of the particle is: x+- “Well, then, how come you can’t tell me what the x-component 
of its spin is?” Because it simply does not have a particular x-component of spin. 
Indeed, it cannot , for if both S x and S z were well defined, the uncertainty principle 
would be violated. 

At this point our challenger grabs the test tube and measures the x-component 
of its spin; let’s say he gets the value +ft/2. “Aha!” (he shouts in triumph), “You 
lied\ This particle has a perfectly well-defined value of S x : It’s ft/2V Well, sure—it 
does now , but that doesn’t prove it had that value, prior to your measurement. “You 
have obviously been reduced to splitting hairs. And anyway, what happened to your 
uncertainty principle? I now know both S x and S z .” I’m sorry, but you do not: In 
the course of your measurement, you altered the particle’s state; it is now in the state 
X+*, and whereas you know the value of S x , you no longer know the value of S-. 
“But I was extremely careful not to disturb the particle when I measured 5/.” 26 Very 
well, if you don’t believe me, check it out: Measure S z , and see what you get. (Of 
course, he may get +ft /2, which will be embarrassing to my case—but if we repeat 
this whole scenario over and over, half the time he will get —ft/2.) 

To the layperson, the philosopher, or the classical physicist, a statement of 
the form “this particle doesn’t have a well-defined position” (or momentum, or 


26 Neils Bohr was anxious to track down the mechanism by which the measurement of S x inevitably 
destroys the value of S z , in gedanken experiments of this sort. His famous debates with Einstein include 
many delightful examples, showing in detail how experimental constraints serve to enforce the uncertainty 
principle. 
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x -component of spin angular momentum, or whatever) sounds vague, incompetent, 
or (worst of all) profound. It is none of these. But its precise meaning is, I think, 
almost impossible to convey to anyone who has not studied quantum mechanics in 
some depth. If you find your own comprehension slipping, from time to time (if 
you don’t, you probably haven’t understood the problem), come back to the spin-1/2 
system: It is the simplest and cleanest context for thinking through the conceptual 
paradoxes of quantum mechanics. 

Problem 4,27 

(a) Check that the spin matrices (Equation 4.147) obey the fundamental commuta¬ 
tion relation for angular momentum: [ S x , S y ] = itiS z . 

(b) Show that the Pauli spin matrices satisfy 

<TjO k = S jk + i € jki ff i , [4.153] 

i 

where the indices stand for x, y, or z, and is the Levi-Cirita symbol: + 1 if 
jkl = 123,231, or 312; -1 if jkl = 132,213, or 321; 0 otherwise. 


❖Problem 4.28 An electron is in the spin state 

* = a( 3 ;). 

(a) Determine the normalization constant A. 

(b) Find the expectation values of S x , S y , and S-. 

(c) Find the “uncertainties” er^, ay v , and cry.. 

<d) Confirm that your results are consistent with all three uncertainty principles 
(Equation 4.100 and its cyclic permutations—only with S in place of L, of 
course). 


❖Problem 4.29 For the most general normalized spinor x (Equation 4.139), com¬ 
pute {S x ), (S y ), (S z ), (Si), (SJ), and (S 2 ). Check that <S 2 ) + (S*) + (S' 2 ) = <S 2 ). 

❖Problem 4.30 

(a) Find the eigenvalues and eigenspinors of S y . 

(b) If you measured S y on a particle in the general state x (Equation 4.139), what 
values might you get, and what is the probability of each? Check that the 
probabilities add up to 1. 

(c) If you measured S?, what values might you get and with what probability? 
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** Problem 4.3 1 Construct the matrix S r representing the component of spin angular 
momentum along an arbitrary direction f. Use spherical coordinates, so that 

r — sin0 cos (j>i + sin# sintj) j + cos 9 k. [4.154] 

Find the eigenvalues and (normalized) eigenspinors of S r . Answer: 

(r)_( cos(#/2) \ Y (C = f sin(#/2) \ 4 5 ] 

X + -^$ 111 ( 0 / 2 ))’ X ~ y —e‘^ cos(#/2 ))' L ‘ J 

Problem 4.32 Construct the spin matrices (S x , S y , and S z ) for a particle of spin 1. 
Hint: How many eigenstates of S z are there? Determine the action of S z , S+, and S- 
on each of these states. Follow the procedure used in the text for spin 1/2. 


4.4.2 Electron in a Magnetic Field 

A spinning charged particle constitutes a magnetic dipole. Its magnetic dipole 
moment fi is proportional to its spin angular momentum S: 

fi = yS ; [4.156] 

the proportionality constant y is called the gyromagnetic ratio. 27 When a magnetic 
dipole is placed in a magnetic field B, it experiences a torque, |txB, which tends 
to line it up parallel to the field (just like a compass needle). The energy associated 
with this torque is 28 

H = [4.157] 

so the Hamiltonian of a spinning charged particle, at rest 29 in a magnetic field B, 
becomes 

H — —yB ■ S, [4.158] 

where S is the appropriate spin matrix (Equation 4.147, in the case of spin 1/2). 

Example: Larmor precession. Imagine a particle of spin 1/2 at rest in a 
uniform magnetic field, which points in the z-direction: 

B = BqL [4.159] 


27 See, for example, D. Griffiths, Introduction to Electrodynamics, 2nd ed, (Englewood Cliffs, NJ: 
Prentice Hall, 1986), page 239. Classically, the gyromagnetic ratio of a rigid object is q/2m, where q is 
its charge and m is its mass. For reasons that are fully explained only in relativistic quantum theory, the 
gyromagnetic ratio of the electron is almost exactly twice the classical value. 

2S Griffiths, (footnote 27), pages 246 and 268. 

29 If the particle is allowed to move, there will also be kinetic energy to consider; moreover, it will 
be subject to the Lorentz force (qv x B), which is not derivable from a potential energy function and hence 
does not fit the Schrodinger equation as we have formulated it so far. I’ll show you later on how to handle 
this problem, but for the moment let’s just assume that the particle is free to rotate, but otherwise stationary. 
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(; •.)■ 


[4.160] 


[4.161] 


The Hamiltonian matrix is 

H = - Y B 0 S 2 = 

The eigenstates of H are the same as those of S z : 

X+, with energy E+ = -(yB 0 ti)/2, 

X~, with energy E _ = +(yB 0 h)/2. 

Evidently, the energy is lowest when the dipole moment is parallel to the field—just 
as it would be classically. 

Since the Hamiltonian is time independent, the general solution to the time- 
dependent Schrodinger equation, 

ih ! E = Hx, [4.162] 

can be expressed in terms of the stationary states: 

X (,)=a x+ e-‘M+b X -e-*-» = (£?£)■ 

The constants a and b are determined by the initial conditions; say 

where \a\ 2 + \b\ 2 = 1. With no essential loss of generality 30 I’ll write a = cos(a/2) 
and b = sin(a/2), where a is a fixed angle whose physical significance will appear 
in a moment. Thus 

I 4 - 1 ®] 

To get a feel for what is happening here, let’s calculate the expectation value of 
the spin <S) as a function of time: 

{S x ) = x(.t)'s,x(. 0 

= (cos(a/2)e~' yBol t 2 %m(ct/2)e‘ y!t,l '' /2 ) \ 


^0 1^ ^ cos(ar/2)e' K5 ° f/2 ^ 


2 

Similarly, 


sin a cos(y5o0- 


sin(a/2)e ‘y^/ 2 J 

[4.164] 


{Sy) = x(t) *SyX(t) = --sinasin(yB 0 t), 


[4.165] 


30 This does assume that a and b are real ; you can work out the general case if you like, but all it 
does is add a constant to t. 
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Figure 4.7: Precession of (S) in a uniform 
magnetic field. 


and 

(S z ) =x(0 t ‘S'zX(0 = ~ cos a. [4.166] 

Evidently (S) is tilted at a constant angle a to the z-axis, and precesses about the field 

at the Larmor frequency 

o.) — yBo, [4.167] 

just as it would classically 31 (see Figure 4.7). No surprise here—Ehrenfest’s theorem 
(in the form derived in Problem 4.21) guarantees that (S) evolves according to the 
classical laws. But it’s nice to see how this works out in a specific context. 


Example: the Stem-Gerlach experiment. In an inhomogeneous magnetic 
field, there is not only a torque, but also a force, on a magnetic dipole 32 : 

F = V(/i-B). [4.168] 

This force can be used to separate out particles with a particular spin orientation, 
as follows. Imagine a beam of relatively heavy neutral atoms, 33 traveling in the 
v-direction, which passes through a region of inhomogeneous magnetic field (Figure 
4.8)—for example, 

B(x, y, z) = — axi + (B 0 + az)k, [4.169] 

where Bq is a strong uniform field and the constant a describes a small deviation 
from homogeneity. (Actually, what we’d like is just the z-component of this field, but 


31 See, for instance, The Feynman Lectures on Physics (Reading, MA: Addison-Wesley, 1964). 
Volume II, Section 34-3. Of course, in the classical case it is the angular momentum vector itself, not just 
its expectation value, that precesses around the magnetic field. 

32 Griffiths, (footnote 27), page 247. Note that F is the negative gradient of the energy (Equation 

4.157). 

33 We make them neutral to avoid the large-scale deflection that would otherwise result from the 
Lorentz force, and heavy so we can construct localized wave packets and treat the motion in terms of 
classical particle trajectories. In practice, the Stem-Gerlach experiment doesn’t work, for example, with a 
beam of free electrons. 



Magnet 

Figure 4.8: The Stern-Gerlach apparatus. 


Spin down 


unfortunately that’s impossible—it would violate the electromagnetic law V • B = 0; 
like it or not, the x-component comes along for the ride.) The force on these atoms is 

F = yct(—S x i + S z k ). 

But because of the Larmor precession about 5 0 > S x oscillates rapidly, and averages 
to zero; the net force is in the z-direction: 

F z = yaS z , [4.170] 

and the beam is deflected up or down, in proportion to the z-component of the spin 
angular momentum. Classically we’d expect a smear, but in fact the beam splits into 
2s + 1 individual beams, beautifully demonstrating the quantization of S z . (If you 
use silver atoms, for example, all the inner electrons are paired in such a way that 
their spin and orbital angular momenta cancel. The net spin is simply that of the 
outermost—unpaired—electron, so in this case 5 = 1/2, and the beam splits in two.) 

That argument was purely classical , up to the final step; “force” has no place in 
a proper quantum calculation, and you might therefore prefer the following approach 
to the same problem. 34 We examine the process from the perspective of a reference 
frame that moves along with the beam. In this frame the Hamiltonian starts out zero, 
turns on for a time T (as the particle passes through the magnet), and then turns off 
again: 

f 0, for t < 0, 

H(t)=\-y(B 0 + az)S z , for0<f<7\ [4.171] 

l 0, for t > T. 

(I ignore the pesky x-component of B, which—for reasons indicated above—is irrel¬ 
evant to the problem.) Suppose the atom has spin 1/2, and starts out in the state 

X(0 = ax++bx~, for t < 0. 

While the Hamiltonian acts, x(t) evolves in the usual way: 

X (t) = a X+ e- iE+t/h +b X -e~ iE - t/h , forO < t < T, 


34 This argument follows L. Ballentine, Quantum Mechanics (Englewood Cliffs, NJ: Prentice Hall, 
1990), page 172. 
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where (from Equation 4.158) 


E± 




[ 4.172 


and hence it emerges in the state 

X(t) = (ae iyTB ^ 2 X+ ) e i(ayT/2)z + (be- iyTBo/2 X -) e - i(ayT/2)z , (t > T ). [ 4.173 


The two terms now carry momentum in the z-direction (see Equation 3.131); the 
spin-up component has momentum 


Pz = 


ayTh 

2 


[ 4.174 


and it moves in the plus-z direction; the spin-down component has the opposite 
momentum, and it moves in the minus-z direction. Thus the beam splits in two, as 
before. (Note that Equation 4.174 is consistent with the earlier result, Equation 4.170. 
for in this case S z =h/ 2 and p z = F Z T.) 

The Stem-Gerlach experiment has played an important role in the philosoph> 
of quantum mechanics, where it serves both as the prototype for the preparation of a 
quantum state and as an illuminating model for a certain kind of quantum measure¬ 
ment. We casually assume that the initial state of a system is known (the Schrodinger 
equation tells us how it subsequently evolves)—but it is natural to wonder how you 
get a system into a particular state in the first place. Well, if you want to prepare a 
beam of atoms in a given spin configuration, you pass an unpolarized beam through 
a Stem-Gerlach magnet and select the outgoing stream you are interested in (closing 
off the others with suitable baffles and shutters). Conversely, if you want to measure 
the z-component of an atom’s spin, you send it through a Stem-Gerlach apparatus and 
record which bin it lands in. I do not claim that this is always the most practical wa\ 
to do the job, but it is conceptually very clean and hence a useful context in which to 
explore the problems of state preparation and measurement. 


Problem 4.33 In the first example (Larmor precession in a uniform magnetic field): 

(a) If you measured the component of spin angular momentum along the x -direction, 
at time t, what is the probability that you would get +h/ 2? 

(b) Same question, but for the y-component. 

(c) Same, but for the z-component. 

** Problem 4.34 An electron is at rest in an oscillating magnetic field 
B = Bq cos (a)t)k, 


where Bq and co are constants. 
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(a) Construct the Hamiltonian matrix for this system. 

(b) The electron starts out (at t - 0) in the spin-up state with respect to the x-axis 
[that is, x(0) = x +*]• Determine x(0 at any subsequent time. Beware: This 
is a time-dependent Hamiltonian, so you cannot get x(t) in the usual way from 
stationary states. Fortunately, in this case you can solve the time-dependent 
Schrodinger equation (Equation 4.162) direcdy. 

(c) Find the probability of getting —h/2 if you measure S x . Answer. 

sin 2 ^n^)^ • 

(d) What is the minimum field (5 0 ) required to force a complete flip in S X 1 


4.4.3 Addition of Angular Momenta 

Suppose now that we have two spin- 1/2 particles—for example, the electron and the 
proton in the ground state 35 of hydrogen. Each can have spin up or spin down, so 
there are four possibilities in all 36 : 

tt, U, 4rt, U. [4.175] 

where the first arrow refers to the electron and the second to the proton. Question: 
What is the total angular momentum of the atom? Let 

S = S (1) +S (2) . [4.176] 

Each of the four composite states is an eigenstate of S z —the z-components simply 
add 

S z x 1 X 2 = (S ( z l) + S z 2) )XiX 2 = (S ( Z 1} X\)X 2 + X^X 2 ) 

= (hmiX\)X2 +Xi(% m 2X2) =h(nti +m 2 )xiX2, 

[note that S (1) acts only on and S (2) acts only on x 2 ]. So m (the quantum number 
for the composite system) is just nt\ + m 2 : 

ff: m = 1; 

m = 0; 

m — 0; 

m — — 1. 


35 I put them in the ground state so there won’t be any orbital angular momentum to worry about. 
36 More precisely, each particle is in a linear combination of spin up and spin down, and the composite 
system is in a linear combination of the four states listed. 
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At first glance, this doesn’t look right: m is supposed to advance in integer 
steps, from —s to + 5 , so it appears that s = 1 —but there is an extra state with m = 0 . 
One way to untangle this problem is to apply the lowering operator S- = + S't' 

to the state ft, using Equation 4.143: 

S_(tt) = (5i n t) t + t (5? t) 

= (s 4.) t +1 (* ±) = »at + to- 

Evidently the three states with s = 1 are (in the notation |,s- m))\ 

(in) = tt 

I |10) = ^(H + It) s = 1 (triplet). [4.177] 

(| 1 - 1 ) = 44 

(As a check, try applying the lowering operator to 110); what should you get? See 
Problem 4.35.) This is called the triplet combination, for the obvious reason. Mean¬ 
while, the orthogonal state with m — 0 carries s = 0 : 

||00> = -f=(tr - rt) J S = 0 (singlet). [4.178] 

(If you apply the raising or lowering operator to this state, you’ll get zero. See Prob¬ 
lem 4.35.) 

I claim, then, that the combination of two spin- 1/2 particles can carry a total spin 
of 1 or 0 , depending on whether they occupy the triplet or the singlet configuration. 
To confirm this, I need to prove that the triplet states are eigenvectors of S 2 with 
eigenvalue 2ft 2 and the singlet is an eigenvector of S 2 with eigenvalue 0. Now 

s 2 = (S (1) + S (2) ) • (S (1) + S {2} ) = (S {1> ) 2 + (S (2) ) 2 + 2 S (1) • S (2) . [4.179] 

Using Equations 4.142 and 4.145, we have 

S (1) ■ S ( 2 , (H) = (S<‘> f)(S< 2) 4) + (S< 1) t)(^, 2> 4) + (S<‘> t)(S? 4) 



ft 2 

= -(2 ft —U). 


Similarly, 

S (1) • S (2) (4f) = ^-(2 f4 - 4t)- 

It follows that 

s a> s (2> |10) = j-k(2 4t - t4 +2 t4 - 4t) = j|10). [4.180] 
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S (,) -S (2, |00) = * * (2 it - tl -2 tl + 4_t) = -^|00). [4.181] 

4 V2 4 

Returning to Equation 4.179 (and again using Equation 4.142), we conclude that 

s 2 |10) = ^- + 2^ 110} = 27t 2 |l 0}, [4.182] 

so 11 0) is indeed an eigenstate of S 2 with eigenvalue 2 ft 2 ; and 

S 2 |00) = - 2^-) |00> = 0, [4.183] 

so 10 0) is an eigenstate of S 2 with eigenvalue 0. (I will leave it for you to confirm that 
11 1) and 11 — 1) are eigenstates of S 2 , with the appropriate eigenvalue—see Prob¬ 
lem 4.35.) 

What we have just done (combining spin 1/2 with spin 1/2 to get spin 1 and 
spin 0) is the simplest example of a larger problem: If you combine spin Si with spin 
s 2 , what total spins s can you get? 37 The answer 38 is that you get every spin from 
(si + s 2 ) down to (5 1 ] - ^ 2 )—or (s 2 - si), if s 2 > si— in integer steps: 

5 = ($i + s 2 ), (si + S 2 - 1), (si + s 2 - 2), ... , |S! - s 2 |. [4.184] 

(Roughly speaking, the highest total spin occurs when the individual spins are aligned 
parallel to one another, and the lowest occurs when they are antiparallel.) For example, 
if you package together a particle of spin 3/2 with a particle of spin 2, you could get a 
total spin of 7/2, 5/2, 3/2, or 1/2, depending on the configuration. Another example: 
If a hydrogen atom is in the state f nlm , the net angular momentum of the electron 
(spin plus orbital) is / + 1/2 or l — 1/2; if you now throw in the spin of the proton, 
the atom’s total angular momentum quantum number is / + 1, /, or l - 1 (and / can 
be achieved in two distinct ways, depending on whether the electron alone is in the 
/ + 1/2 configuration or the / — 1 /2 configuration). 

The particular state | sm) with total spin s and z-component m will be some 
linear combination of the composite states |S] m\) |s 2 mi ): 

\ SM )= C m?m 2 m\Sl m \)\S2 m 2) [4.185] 

(because the z-components add, the only composite states that contribute are those 
for which m i + m 2 = m). Equations 4.177 and 4.178 are special cases of this general 


37 I say spins for simplicity, but either one (or both) could just as well be orbital angular momentum 
(for which, however, we would use the letter /). 

3S For a proof you must look in a more advanced text; see, for instance, Claude Cohen-Tannoudji, 
Bernard Diu, and Franck Laloe, Quantum Mechanics (New York: John Wiley & Sons, 1977), Vol. 2, 
Chapter X. 
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Table 4.7: Clebsch-Gordan coefficients. (A square root sign is understood for 


every entry; the minus sign, if present, goes outside the radical.) 



form, with s\ = S 2 = 1/2 (I used the informal notation f = \\\), |= || (— 5 )))- 
The constants are called Clebsch-Gordan coefficients. A few of the simplest 

cases are listed in Table 4.7 . 39 For example, the shaded column of the 2 x 1 table tells 
us that 

|2 1> = 2=|22)|1-1) + 2=|21)|10> - 4h 20 >|! ')• 

V3 V 6 V2 

In particular, if two particles (of spin 2 and spin 1) are at rest in a box, and the total 
spin is 2 , and its z-component is 1 , then a measurement of S) (1) could return the value 
2h (with probability 1/3), or h (with probability 1/6), or 0 (with probability 1/2). 
Notice that the probabilities add up to 1 (the sum of the squares of any column on the 
Clebsch-Gordan table is 1 ). 

These tables also work the other way around: 

\sim l )\s 2 m 2 ) = [4 186] 

For example, the shaded row in the 3/2 x 1 table tells us that 



39 The general formula is derived in Amo Bohm, Quantum Mechanics: Foundations and Applica¬ 
tions, 2nd ed. (New York: Springer-Verlag, 1986), p. 172. 
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If you put particles of spin 3/2 and spin 1 in the box, and you know that the first has 
mi = 1/2 and the second has m 2 = 0 (so m is necessarily 1/2), and you measured the 
total spin s, you could get 5/2 (with probability 3/5), or 3/2 (with probability 1/15), 
or 1/2 (with probability 1/3). Again, the sum of the probabilities is 1 (the sum of the 
squares of each row on the Clebsch-Gordan table is 1). 

If you think this is starting to sound like mystical numerology, I don’t blame 
you. We will not be using the Clebsch-Gordan tables much in the rest of the book, 
but I wanted you to know where they fit into the scheme of things, in case you 
encounter them later on. In a mathematical sense this is all applied group theory— 
what we are talking about is the decomposition of the direct product of two irreducible 
representations of the rotation group into a direct sum of irreducible representations. 
(You can quote that to impress your friends.) 

♦Problem 4.35 

(a) Apply S- to 110) (Equation 4.177), and confirm that you get V%h\\ —1). 

(b) Apply S± to |00) (Equation 4.178), and confirm that you get zero. 

(c) Show that 11 1) and 11 — 1 > (Equation 4.177) are eigenstates of S 2 , with the 
appropriate eigenvalue. 

Problem 4.36 Quarks carry spin 1/2. Three quarks bind together to make a 
baryon (such as the proton or neutron); two quarks (or more precisely a quark and 
an antiquark) bind together to make a meson (such as the pion or the kaon). Assume 
the quarks are in the ground state (so the orbital angular momentum is zero). 

(a) What spins are possible for baryons? 

(b) What spins are possible for mesons? 

Problem 4.37 

(a) A particle of spin 1 and a particle of spin 2 are at rest in a configuration such 

that the total spin is 3, and its z-component is 1 (that is, the eigenvalue of S z is 

h). If you measured the z-component of the angular momentum of the spin-2 
particle, what values might you get, and what is the probability of each one? 

(b) An electron with spin down is in the state i/^io of the hydrogen atom. If you 
could measure the total angular momentum squared of the electron alone (not 
including the proton spin), what values might you get, and what is the probability 
of each? 


Problem 4.38 Determine the commutator of S 2 with S^ (where S = S (1) + S (2) ). 
Generalize your result to show that 

[S 2 , S (1) ] = 2i7i(S (1) x S (2) ). 


[4.187] 
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Note : Because does not commute with S 2 , we cannot hope to find states that are 
simultaneous eigenvectors of both. To form eigenstates of S 2 , we need linear combi¬ 
nations of eigenstates of S^ 13 . This is precisely what the Clebsch-Gordan coefficient' 
(in Equation 4.185) do for us. On the other hand, it follows by obvious inference fron: 
Equation 4.187 that the sum S (1) + S (2) does commute with S 2 , which only confirm' 
what we already knew (see Equation 4.103). 


FURTHER PROBLEMS FOR CHAPTER 4 _ 

❖Problem 4.39 Consider the three-dimensional harmonic oscillator, for which 
the potential is 

V(r) = ^mco 2 r 2 . [4.188' 


(a) Show that separation of variables in Cartesian coordinates turns this into three 
one-dimensional oscillators, and exploit your knowledge of the latter to deter¬ 
mine the allowed energies. Answer. 

E n = (n + 3/2)ha). [4.189] 

(b) Determine the degeneracy d(n) of E n . 

❖ ❖❖Problem 4.40 Because the three-dimensional harmonic oscillator potential (Equa¬ 
tion 4.188) is spherically symmetric, the Schrodinger equation can be handled b\ 
separation of variables in spherical coordinates as well as Cartesian coordinates. Use 
the power series method to solve the radial equation. Find the recursion formula 
for the coefficients, and determine the allowed energies. Check your answer against 
Equation 4.189. 

❖❖Problem 4.41 

(a) Prove the three-dimensional virial theorem 

2(D = (r-VE) [4.190] 

(for stationary states). Hint'. Refer to Problem 3.53. 

(b) Apply the virial theorem to the case of hydrogen, and show that 

( T) = -E n ; { V)=2E„. [4.191] 

(c) Apply the virial theorem to the three-dimensional harmonic oscillator (Prob¬ 
lem 4.39), and show that in this case 

(T) = {V) = EJ 2. 


[4.192] 
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***Problem 4.42 The momentum-space wave function in three dimensions is de¬ 
fined by the natural generalization of Equation 3.132: 

0(P) S (2dhw / e “" P ‘ r)/ V(r)rf 3 r. [4.193] 

(a) Find the momentum-space wave function for the ground state of hydrogen 
(Equation 4.80). Hint: Use spherical coordinates, setting the polar axis along 
the direction of p. Do the 6 integral first. Answer: 

1 f2a \ 3/2 1 

* (p) -(j) n TFFm- [4194] 

(b) Check that 0 (p) is normalized. 

(C) Use 0(p) to calculate ( p 2 ). 

(d) What is the expectation value of the kinetic energy in this state? Express your 
answer as a multiple of E { , and check that it is consistent with the virial theorem 
(Equation 4.191). 

Problem 4.43 

(a) Construct the spatial wave function (0) for hydrogen in the state n = 3,1 = 2, 
m = 1. Express your answer as a function of r, 0, 0, and a (the Bohr radius) 
only —no other variables (p, z, etc.), or functions (7, v, etc.), or constants (A, 
a 0 , etc.), or derivatives allowed (tt is okay, and e , and 2, etc.). 

(b) Check that this wave function is properly normalized by carrying out the appro¬ 
priate integrals over r, 0, and 0. 

(c) Find the expectation value of r s in this state. For what range of s is the result 
finite? 


***Problem 4.44 Suppose two spin-1/2 particles are known to be in the singlet con¬ 
figuration (Equation 4.178). Let be the component of the spin angular momentum 
of particle number 1 in the direction defined by the unit vector a. Similarly, let S 
be the component of 2’s angular momentum in the direction b. Show that 

t ,2 

(5j 1) 5f ) ) = —- cosfl, [4.195] 

a o 4 


where 0 is the angle between a and b. 

***Problem 4.45 Work out the Clebsch-Gordan coefficients for the case si — 1 /2, 
^2 — anything. Hint: You’re looking for the coefficients A and B in 

\sm) = A\\ i)|$ 2 ( m - \)) + B\\ (—J))|j 2 ( m + ±)), 
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such that j sm) is an eigenstate of S 2 . Use the method of Equations 4.179 through 
4.182. If you can’t figure out what Sj: 2) (for instance) does to \s 2 tn 2 ), refer back n 
Equations 4.136 and 4.144. Use this general result to construct the (1/2) x 1 table 
of Clebsch-Gordan coefficients, and check it against Table 4.7. Answer. 


s 2 ±m + \/2' D _ ± js 2 =F m + 1/2 


2 s 2 + 1 


2S2+ 1 


where the signs are determined by s = s 2 ± 1 /2. 

Problem 446 Find the matrix representing S x for a particle of spin 3/2 (using 
the basis of eigenstates of S z ). Solve the characteristic equation to determine the 
eigenvalues of S x . 

***Problem 447 Work out the normalization factor for the spherical harmonics, a-. 
follows. From Section 4.1.2 we know that 


Y; u = Bf e im ^ Pf (cos 6); 

the problem is to determine the factor Bf (which I quoted, but did not derive, in 
Equation 4.32). Use Equations 4.120, 4.121, and 4.130 to obtain a recursion relation 
giving Bf +l in terms of Bf. Solve it by induction on m to get Bf up to an overall 
constant C (/). Finally, use the result of Problem 4.22 to fix the constant. You may find 
the following formula for the derivative of an associated Legendre function useful: 

(1 -x 2 )/L = yi -x 2 /f +l - mxPP- [4.196] 


Problem 448 The electron in a hydrogen atom occupies the combined spin and 
position state 



(a) If you measured the orbital angular momentum squared (L 2 ), what values might 
you get, and what is the probability of each? 

(b) Same for the z-component of orbital angular momentum ( L z ). 

(c) Same for the spin angular momentum squared (S' 2 ). 

(d) Same for the z-component of spin angular momentum (S z ). 

Let J = L + S be the total angular momentum. 

(e) If you measured J 2 , what values might you get, and what is the probability of 
each? 

(f) Same for J z . 
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(g) If you measured the position of the particle, what is the probability density for 
finding it at r,Q,<j>l 

(h) If you measured both the z-component of the spin and the distance from the ori¬ 
gin (note that these are compatible observables), what is the probability density 
for finding the particle with spin up and at radius r? 

***Problem 4.49 

(a) For a function / {<p) that can be expanded in a Taylor series, show that 

/(^ + 0o) = e' i -' fc/ */W 

(where </>o is any constant angle). For this reason, L z fli is called the generator of 
rotations about the z-axis. Hint: Use Equation 4.129, and refer to Problem 3.55. 

More generally, L • h/h is the generator of rotations about the direction h, in 
the sense that exp(/L ■ n<p/ft) effects a rotation through angle <p (in the right-hand 
sense) about the axis h. In the case of spin, the generator of rotations is S ■ h/Ti. In 
particular, for spin 1/2 

X = e‘^ /2 x [4.197] 

tells us how spinors rotate. 

(b) Construct the (2 x 2) matrix representing rotation by 180° about the x-axis, 
and show that it converts “spin up” (y + ) into “spin down” (y_), as you would 
expect. 

(c) Construct the matrix representing rotation by 90° about the y-axis, and check 
what it does to y + . 

<d) Construct the matrix representing rotation by 360° about the z-axis. If the 
answer is not quite what you expected, discuss its implications. 

(e) Show that 

e i(a-n)<p/i _ cos ^y2) + i(n - <t) sin(<p/2). [4.198] 

** Problem 4.50 The fundamental commutation relations for angular momentum 
(Equations 4.98 and 4.99) allow for half-integer (as well as integer) eigenvalues. But 
for orbital angular momentum only the integer values occur. There must be some 
extra constraint in the specific form L = r x p that excludes half-integer values. 40 
Let a be some convenient constant with the dimensions of length (the Bohr radius, 
say, if we’re talking about hydrogen), and define the operators 

?i = [* + (a 2 /h)p y ]- Pi = j^ [p x - Qi/a 2 )y\- 
qi = ^ [x - ( a 2 /h)p y ]; p 2 = -j= [p x + {h/a 2 )y\ 

40 This problem is based on an argument in Ballentine, (footnote-34), page 127. 
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(a) Verify that [q x , q 2 \ = [p\, p 2 ] = 0; [q i, P\] = [q 2 , Pi] = ifi- Thus the q' s ami 
the p's satisfy the canonical commutation relations for position and momentum, 
and those of index 1 are compatible with those of index 2. 

(b) Show that 



(c) Check that L z = H\ — H 2 , where each H is the Hamiltonian for a harmonic 
oscillator with mass m =Ti ja 2 and frequency co=\. 

(d) We know that the eigenvalues of the harmonic oscillator Hamiltonian are (n ~ 
1/2 )tica, where n = 0,1,2, ... (in the algebraic theory of Section 2.3.1, thb 
follows from the form of the Hamiltonian and the canonical commutation rela¬ 
tions). Use this to conclude that the eigenvalues of L z must be integers. 


***Problem 4.51 In classical electrodynamics the force on a particle of charge q 
moving with velocity v through electric and magnetic fields E and B is given by the 

Lorentz force law: 

F — <?(E + v x B). [4.199] 

This force cannot be expressed as the gradient of a scalar potential energy function, 
and therefore the Schrodinger equation in its original form (Equation 1.1) cannot 
accomodate it. But in the more sophisticated form 

dW 

m — = HV [4.200] 

dt 

there is no problem; the classical Hamiltonian 41 is 

H = -^-(p — <?A) 2 + q(p, [4.201] 


where A is the vector potential (B — V x A) and <p is the scalar potential (E = — V<p - 
dkfdt), so the Schrodinger equation (making the canonical substitution p -»• (i/h)V) 
becomes 


ifi 


~dt 



vk. 


[4.202] 


(a) Show that 

^ = -<(p-?A)). [4.203] 

at m 


41 See, for example, H. Goldstein, Classical Mechanics, 2nd ed., Addison-Wesley, Reading, MA. 
1980, page 346. 
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(b) As always (see Equation 1.32) we identify d{r)/dt with (v). Show that 

m —Jr ~ *7 (E> + A ((P x B — B x p)) — ?-((A X B)). [4.204] 

at 2m m 

(c) In particular, if the fields E and B are uniform over the volume of the wave 
packet, show that 

m ^l=q( E+(v)xB), [4.205] 

at 

so the expectation value of (v) moves according to the Lorentz force law, as we 
would expect from Ehrenfest’s theorem. 


***Problem 4.52 [Refer to Problem 4.51 for background.] Suppose 
A = ^-(xj — yi ), and <p = Kz 2 , 

where Bq and K are constants. 

(a) Find the fields E and B. 

(b) Find the allowed energies, for a particle of mass m and charge q, in these fields. 


**Problem 4.53 [Refer to Problem 4.51 for background.] In classical electrodynam¬ 
ics the potentials A and (p are not uniquely determined 42 ; the physical quantities are 
th e fields, E and B. 

(a) Show that the potentials 

3 A 

(p = <p ——, A' = A + VA [4.206] 

at 

(where A is an arbitrary real function of position and time) yield the same fields 
as <p and A. Equation [4.206] is called a gauge transformation, and the theory 
is said to be gauge invariant. 

(b) In quantum mechanics the potentials play a more direct role, and it is of interest 
to know whether the theory remains gauge invariant. Show that 


= gigA/hy [4.207] 

satisfies the Schrodinger equation [4.202] with the gauge-transformed potentials 
<p' and A'. Since differs from 4/ only by a phase factor, it represents the 


42 See, for example, Griffiths, (footnote 27), section 10.2.4. 
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same physical state 43 , and the theory is gauge invariant (see Section 10.2.4 for 
further discussion). 


43 That is to say, (r), d{r)/dt, etc. are unchanged. Because A depends on position, {p} (with 
p represented by the operator (fc//')V) does change, but as we found in Equation [4.203], p does 
not represent the mechanical momentum (mv) in this context (in Lagrangian mechanics it is so-called 

canonical momentum). 



CHAPTER 5 


IDENTICAL PARTICLES 


5.1 TWO-PARTICLE SYSTEMS _ 

For a single particle, the wave function (r, t) is a function of the spatial coordinates 
r and the time t (we’ll ignore spin for the moment). The wave function for a two- 
particle system is a function of the coordinates of particle one (n), the coordinates 
of particle two (r 2 ), and the time: 


^(ri,r 2 ,r). [5.1] 

Its time evolution is determined (as always) by the Schrodinger equation: 

ih— = HV, [5.2] 

where H is the Hamiltonian for the whole system: 

H = -& 1 V '-& 2 V ' + Vir '^ l) i 53 ' 

(the subscript on V indicates differentiation with respect to the coordinates of particle 
1 or particle 2, as the case may be). The statistical interpretation carries over in the 
obvious way: 

\'t>(r u r 2 ,t)\ 2 d 3 r l d 3 r 2 [5.4] 

is the probability of finding particle 1 in the volume d 3 r 3 and particle 2 in the volume 
d 3 r 2 ; evidently 'T must be normalized in such a way that 

j | 'R(ri, r 2 , t )\ 2 d 3 V\ d 3 r 2 = 1. 


[5.5] 
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For time-independent potentials, we obtain a complete set of solutions by sep¬ 
aration of variables: 

^(ri, r 2 , 0 — iA( r i> r 2 )e~ lEt/fl , [5.6' 

where the spatial wave function (\j /) satisfies the time-independent Schrodinger equa¬ 
tion: 

--—V? iff --—V^i/r + V \j/ = E\j/> [5.7 

2 m, 1 lm 2 ~ 

and E is the total energy of the system. 


**Problem 5.1 Typically, the interaction potential depends only on the vector 
r = Tj - r 2 separating the two particles. In that case the Schrodinger equation 
separates, if we change variables from r,, r 2 to r, R = (m\Y\ + m 2 v 2 )/(m\ + m 2 1 
(the center of mass). 


(a) Show that rj = R + (ti/m \ )r, r 2 ■ 
V 2 = (ii/m i)V/f - V r , where 


= R — {fi/m 2 ) r, and Vi 


e = : 


m\m 2 


m \ +m 2 

is the reduced mass of the system. 

(b) Show that the (time-independent) Schrodinger equation becomes 


: (fi/m 2 )V R + V,. 

[5.8] 


r 


2(Wi + m 2 ) 


V 2 R xj/ - -—+ V (r= Ei{f. 


(C) Solve by separation of variables, letting \j/( R, r) — *jj R (R)i/r r (r). Note that ij/ R 
satisfies the one-particle Schrodinger equation, with the total mass (mi + m 2 1 
in place of m, potential zero, and energy E R , while \jr r satisfies the one-particle 
Schrodinger equation with the reduced mass in place of m, potential F(r), and 
energy E r . The total energy is the sum: E = E R + E r . Note: What this tells 
us is that the center of mass moves like a free particle, and the relative motion 
(that is, the motion of particle 2 with respect to particle 1 ) is the same as if we 
had a single particle with the reduced mass, subject to the potential V . Exactly 
the same separation occurs in classical mechanics 1 ; it reduces the two-bod\ 
problem to an equivalent one-body problem. 


Problem 5.2 In view of Problem 5.1, we can correct for the motion of the nucleus 
in hydrogen by simply replacing the electron mass with the reduced mass: 


(a) Find (to two significant digits) the percent error in the binding energy of hydro¬ 
gen (Equation 4.77) introduced by our use of m instead of p. 


1 See, for example, Jerry Marion, Classical Dynamics, 2nd ed. (New York: Academic Press 1970 1 . 
Section 8.2. 
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(b) Find the separation in wavelength between the red Balmer lines 
(n - 3 n - 2) for hydrogen and deuterium. 

(c) Find the binding energy of positronium (in which the proton is replaced by a 
positron—positrons have the same mass as electrons but opposite charge). 

(d) Suppose you wanted to confirm the existence of muonic hydrogen, in which the 
electron is replaced by a muon (same charge, but 206.77 times heavier). Where 
(i.e., at what wavelength) would you look for the “Lyman-a” line 
(n = 2 n = 1)? 


5.1.1 Bosons and Fermions 

Suppose particle 1 is in the (one-particle) state fr a (j), and particle 2 is in the state 
lM r )- In that case is(r u r 2 ) is a simple product: 

iA( r i,r 2 ) = tMri)iMr 2 ). [5.9] 

Of course, this assumes that we can tell the particles apart—otherwise it wouldn’t 
make any sense to claim that number 1 is in state f a and number 2 is in state ijr b \ all 
we could say is that one of them is in the state and the other is in state f b , but 
we wouldn’t know which is which. If we were talking about classical mechanics this 
would be a silly objection: You can always tell the particles apart, in principle—just 
paint one of them red and the other one blue, or stamp identification numbers on 
them, or hire private detectives to follow them around. But in quantum mechanics 
the situation is fundamentally different: You can’t paint an electron red, or pin a label 
on it, and a detective’s observations will inevitably and unpredictably alter the state, 
raising doubts as to whether the two had perhaps switched places. The fact is, all 
electrons are utterly identical , in a way that no two classical objects can ever be. It is 
not merely that we don’t happen to know which electron is which; God doesn’t know 
which is which, because there is no such thing as “this” electron, or “that” electron; 
all we can legitimately speak about is “an” electron. 

Quantum mechanics neatly accommodates the existence of particles that are 
indistinguishable in principle: We simply construct a wave function that is noncom¬ 
mittal as to which particle is in which state. There are actually two ways to do it: 

iMri, r 2 ) = A[^ a ( r])^(r 2 ) ± ^(n)^ a (r 2 )]. [5.10] 

Thus the theory admits two kinds of identical particles: bosons, for which we use the 
plus sign, and fermions, for which we use the minus sign. Photons and mesons are 
bosons; protons and electrons are fermions. It so happens that 


all particles with integer spin are bosons, and 
all particles with half-integer spin are fermions. 


[5.11] 
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This connection between spin and “statistics” (as we shall see, bosons and fermions 
have quite different statistical properties) can be proved in relativistic quantum me¬ 
chanics; in the nonrelativistic theory it must be taken as an axiom. 

It follows, in particular, that two identical fermions (for example, two electrons) 
cannot occupy the same state. For if = ft, then 

iff— (ri, r 2 ) = A\lff a (Y{)if a <X 2 ) - lMl*l )lM r 2 >] = 0, 

and we are left with no wave function at all. This is the famous Pauli exclusion 
principle. It is not (as you may have been led to believe) a bizarre ad hoc assumption 
applying only to electrons, but rather a consequence of the rules for constructing 
two-particle wave functions, applying to all identical fermions. 

I assumed, for the sake of argument, that one particle was in the state xf/ a and 
the other in state tfr b , but there is a more general (and more sophisticated) way to 
formulate the problem. Let us define the exchange operator P which interchanges 
the two particles: 

Pf(r u r 2 ) = /0*2, rO- [5.12] 

Clearly, P 2 = 1, and it follows (prove it for yourself) that the eigenvalues of P are ± 1 
If the two particles are identical, the Hamiltonian must treat them the same: m j = w; 
and V(ti , r 2 ) = V (r 2 , ri). It follows that P and H are compatible observables, 

[P,H]= 0, [5.13] 


and hence we can find a complete set of functions that are simultaneous eigenstates of 
both. That is to say, we can find solutions to the Schrodinger equation that are either 
symmetric (eigenvalue +1) or antisymmetric (eigenvalue -1) under exchange: 


^(n,r 2 ) = ±iff(r 2 , r0 (+ for bosons, — for fermions). 


[5.14] 


Moreover, if a system starts out in such a state, it will remain in such a state. The new 
law (I’ll call it the symmetrization requirement) is that for identical particles the 
wave function is not merely allowed , but required to satisfy Equation 5.14, with the 
plus sign for bosons and the minus sign for fermions. 2 This is the general statement, 
of which Equation 5.10 is a special case. 


2 It is sometimes suggested that the symmetrization requirement (Equation 5.14) is nothing new— 
that it is forced by the fact that P and H commute. This is false: It is perfectly possible to imagine a system 
of two distinguishable particles (say, an electron and a positron) for which the Hamiltonian is symmetric, 
and yet there is no requirement that the wave function be symmetric (or antisymmetric). But identical 
particles have to occupy symmetric or antisymmetric states, and this is a completely new fundamental 
l aw —on a par, logically, with Schrodinger’s equation and the statistical interpretation. Of course, there 
didn’t have to be any such things as identical particles; it could have been that every single particle in 
nature was clearly distinguishable from every other one. Quantum mechanics allows for the possibility of 
identical particles, and nature (being lazy) seized the opportunity. (But I’m not complaining—this makes 
matters enormously simpler!) 
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Example. Suppose we have two noninteracting 3 particles, both of mass m , in 
the infinite square well (Section 2.2). The one-particle states are 


fn{x) = 



£„ =n 2 K 


(where K = jz 2 ti 2 /2ma 2 ). If the particles are distinguishable, the composite wave 
functions are simple products: 

fn,n 2 {x ],x 2 ) = f nx {x x )ir n2 {x 2 ), E nini = (n 2 + n\)K. 

For example, the ground state is 


2 . 

tyn = - sin(7rx 1 /fl) sin(jrx 2 /u), £"11 = 2 K; 
a 

the first excited state is doubly degenerate: 


^12 = - sin(7rxi/a) sin(27rx 2 /a), £12 = 5K, 

a 

2 

^21 = - sin(2^x 1 /u) sin(^X2/u), £21 = 5£; 

a 

and so on. If the two particles are identical bosons, the ground state is unchanged, 
but the first excited state is nondegenerate: 


— [s\n{nx x /a) sin(2 7tx 2 /a) + sin(2^xi/u) sin(7rx 2 /u)] 

(still with energy 5K). And if the particles are identical fermions, there is no state 
with energy 2 K\ the ground state is 

V2 

— [sin(7rxi/u) sin(2^X2/«) - sin(27TXi/fl) sin(jrx 2 /a)], 
a 

and its energy is 5K. 


❖Problem 5.3 


(a) If tf/ a and ^ are orthogonal, and both normalized, what is the constant A in 
Equation 5.10? 

(b) If \f/ a = i/v, (and it is normalized), what is A? (This case, of course, occurs only 
for bosons.) 

3 They pass right through one another—never mind how you would set this up in practice! I’ll 
ignore spin—if this bothers you (after all, a spinless fermion is a contradiction in terms), assume they’re 
in the same spin state. 
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Problem 5.4 

(a) Write down the Hamiltonian for two identical noninteracting particles in the 
infinite square well. Verify that the fermion ground state given in the example 
is an eigenfunction of H , with the appropriate eigenvalue. 

(b) Find the next two excited states (beyond the ones given in the example)—wave 
functions and energies—for each of the three cases (distinguishable, identical 
bosons, identical fermions). 


5.1.2 Exchange Forces 

To give you some sense of what the symmetrization requirement actually does , I'm 
going to work out a simple one-dimensional example. Suppose one particle is in 
state i/ a (.x), and the other is in state ^(x), and these two states are orthogonal and 
normalized. If the two particles are distinguishable, and number 1 is the one in state 
then the combined wave function is 

\Jr(x u x 2 ) = f a {xi)f b {x 2 )\ [5.15] 

if they are identical bosons, the composite wave function is (see Problem 5.3 for the 
normalization) 


f+(X\,X 2 ) = -J=[fa(X\)fb(X2) + iM* \)fa(x 2 )]\ 


and if they are identical fermions, it is 


1 


f-(x { ,X 2 ) = —^[fa(X\)f b {x 2 ) ~ fb(Xx)f a (X2)l 

V2 


[5.16] 


[5.17] 


Let’s calculate the expectation value of the square of the separation distance 
between the two particles, 


<(*1 - x 2 f) = <x 2 ) + (x 2 ) - 2{x x x 2 ). 


[5.18] 


Case 1: Distinguishable particles. For the wave function in Equation 5.15. 
we have 


(x 2 ) = J x\\^ a {x x )\ 2 dx x J \ir b (x 2 )\ 2 dx 2 = (x 2 ) a 
(the expectation value of x 2 in the one-particle state \fr a ), 

{x\) = j \i/ a (xx)\ 2 dxx J x%\xl/b(x 2 )\ 2 dx 2 - {x 2 ) b , 
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<Xi* 2 ) = / * 1 |^(xi)| 2 dx ] / x 2 \f(x 2 )\ 2 dx 2 = {x) a (x) b . 


In this case, then, 


<(Xi - X 2 Y)d = (X 2 )a + < X 2 ) b ~ 2(x) a (x) b . 


(Incidentally, the answer would—of course—be the same if particle 1 had been in 
state -07,, and particle 2 in state y a .) 

Case 2: Identical particles. For the wave functions in Equations 5.16 and 


x^\ir a (xi)\ 2 dxi /|^(X2)| 2 .X2 
+ J x 2 \f b (xi)\ 2 dx { j \^ a (x 2 )\ 2 dx 2 
± j X 2 Tj/ a (Xx )*^ b {x x )dx x j ^ b (x 2 y^a(X2) dx 2 

± jx\if b {xiTif a {x ] )dx\ f M x 2 y M x 2) dx 2 ] 


-[{X 2 )a + (. X 2 ) b ± 0 ± 0] = - (<X 2 ) a + (X 2 ) b ) . 


(x 2 ) = - {(X 2 ) b + <x 2 } a ). 

(Naturally, <xf) = (x 2 ), since you can’t tell them apart.) But 


*2) = X]\ijf a (xi)| 2 dx\ J x 2 \f b {x 2 )\ 2 dx 2 

+ /xiltMx,)! 2 ^ j x 2 \xj/ a (x 2 )\ 2 dx 2 
± j x x y a {x x )*f b {x x )dx l j x 2 f b (x 2 )* f a (x 2 ) dx 2 
± j X\ij/ b (xi)* \j/ a (xi) dx\ j x 2 \l/ a (x 2 )*\l/ b (x 2 )dx 2 ^ 
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= l;{(x)a(x)b + {x)b(x)a ± (x)ab{x)ba ± {x)ba{x)ab) 

= {X) a {x)b±\{x) ab \ 2 , 


where 


Evidently 


{x)ab — 


J xf a (x)*f b (x)dx. 


[5.20 


{(*1 - X 2 ) 2 )± = { X 2 ) a + {x 2 )b ~ 2{x) a {x) b 2|<*U| 2 . [5.21 

Comparing Equations 5.19 and 5.21, we see that the difference resides in the 
final term: 


<(Ax) 2 }± = ((Ax) 2 ),, =F 2|(xU| 2 ; [5.22 

identical bosons (the upper signs) tend to be somewhat closer together, and identical 
fermions (the lower signs) somewhat farther apart, than distinguishable particles ir. 
the same two states. Notice that {x) a b vanishes unless the two wave functions actually 
overlap [if ^r a {x) is zero wherever i/r 6 (jc) is nonzero, the integral in Equation 5.20 F 
itself zero]. So if represents an electron in an atom in Chicago and ifr b represent' 
an electron in an atom in Seattle, it’s not going to make any difference whether you 
antisymmetrize the wave function or not. As a practical matter, therefore, it’s oka> 
to pretend that electrons with nonoverlapping wave functions are distinguishable 
(Indeed, this is the only thing that allows physicists and chemists to proceed at all. 
for in principle every electron in the universe is linked to every other one via the 
antisymmetrization of their wave functions, and if this really mattered, you wouldn't 
be able to talk about any one electron until you were prepared to deal with them all'. • 
The interesting case is when there is some overlap of the wave functions. The 
system behaves as though there were a “force of attraction” between identical bosons, 
pulling them closer together, and a “force of repulsion” between identical fermions, 
pushing them apart. We call it an exchange force, although it’s not really a force at 
all—no physical agency is pushing on the particles; rather, it is a purely geometrical 
consequence of the symmetrization requirement. It is also a strictly quantum me¬ 
chanical phenomenon, with no classical counterpart. Nevertheless, it has profound 
consequences. Consider, for example, the hydrogen molecule (H 2 ). Roughly speak¬ 
ing, the ground state consists of one electron in the atomic ground state (Equation 
4.80) centered on nucleus 1, and one electron in the atomic ground state centered at 
nucleus 2. If electrons were bosons, the symmetrization requirement (or, if you like, 
the “exchange force”) would tend to concentrate the electrons toward the middle, 
between the two protons (Figure 5.1a), and the resulting accumulation of negative 
charge would attract the protons inward, accounting for the covalent bond that holds 
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(a) (b) 


Figure 5.1: Schematic picture of the covalent bond: (a) Symmetric configura¬ 
tion produces attractive force; (b) antisymmetric configuration produces repul¬ 
sive force. 


the molecule together. Unfortunately, electrons aren’t bosons, they’re fermions, and 
this means that the concentration of negative charge should actually be shifted to the 
wings (Figure 5.1b), tearing the molecule apart! 

But wait. We have been ignoring spin. The complete state of the electron in¬ 
cludes not only its position wave function, but also a spinor, describing the orientation 
of its spin 4 : 


VKr)x(s). [5.23] 

When we put together the two-electron state, it is the whole works, not just the 
spatial part, that has to be antisymmetric with respect to exchange. Now, a glance 
back at the composite spin states (Equations 4.177 and 4.178) reveals that the singlet 
combination is antisymmetric (and hence would have to be joined with a symmetric 
spatial function), whereas the three triplet states are all symmetric (and would require 
an antisymmetric spatial function). Evidently, then, the singlet state should lead to 
bonding, and the triplet to twfi'bonding. Sure enough, the chemists tell us that covalent 
bonding requires the two electrons to occupy the singlet state, with total spin zero. 5 

♦Problem 5.5 Imagine two noninteracting particles, each of mass m, in the infinite 
square well. If one is in the state (Equation 2.24) and the other in state 
orthogonal to f n , calculate ((xi - x 2 ) 2 ), assuming that (a) they are distinguishable 
particles, (b) they are identical bosons, and (c) they are identical fermions. 

Problem 5.6 Suppose you had three particles, one in state ^ a (x), one in state ^(x), 
and one in state ^ c (x). Assuming that i// a , \jr b , and i/v are orthonormal, construct 
the three-particle states (analogous to Equations 5.15, 5.16, and 5.17) representing 
(a) distinguishable particles, (b) identical bosons, and (c) identical fermions. Keep in 
mind that (b) must be completely symmetric under interchange of any pair of particles, 
and (c) must be completely anti-symmetric in the same sense.) Note: There’s a cute 


4 In the absence of coupling between spin and position, we are free to assume that the state is 
separable in its spin and spatial coordinates. This just says that the probability of getting spin up is 
independent of the location of the particle. In the presence of coupling, the general state would take the 
form of a linear combination: xk+(r)x+ + V'-lUX-- 

5 In casual language, it is often said that the electrons are “oppositely aligned” (one with spin up, 
and the other with spin down). This is something of an oversimplification, since the same could be said of 
the m = 0 triplet state. The precise statement is that they are in the singlet configuration. 
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trick for constructing completely antisymmetric wave functions: Form the Slater 
determinant, whose first row is iAa(*i), i^b( x i), i^c( x 1 ), etc., whose second row E 
^ai x i), ^b( x 2 ), ^ c (x 2 ), etc., and so on (this device works for any number of particles). 


5.2 ATOMS 


A neutral atom, of atomic number Z, consists of a heavy nucleus, with electric charge 
Ze , surrounded by Z electrons (mass m and charge —e). The Hamiltonian for thE 
system is 6 


H = 



2m 


1 ( 1 e2 

\4tz€o) rj + 2 y 4tt€q J j^ k |r y - r*j 


[5.24] 


The term in curly brackets represents the kinetic plus potential energy of the yth 
electron in the electric field of the nucleus; the second sum (which runs over all 
values of j and k except j = k) is the potential energy associated with the mutual 
repulsion of the electrons (the factor of 1/2 in front corrects for the fact that the 
summation counts each pair twice). The problem is to solve Schrodinger’s equation. 


Hf = Ef, 


[5.25] 


for the wave function ^(ri, r 2 ,..., r z ). Because electrons are identical fermions, 
however, not all solutions are acceptable: only those for which the complete state 
(position and spin), 


^(r!,r 2 , ...,r z )x(s 1 ,s 2 , ...,s z ), [5.26] 

is antisymmetric with respect to interchange of any two electrons. In particular, no 
two electrons can occupy the same state. 

Unfortunately, the Schrodinger equation with the Hamiltonian in Equation 5.24 
cannot be solved exactly (at any rate, it hasn ’t been) except for the very simplest case. 
Z = 1 (hydrogen). In practice, one must resort to elaborate approximation methods. 
Some of these we shall explore in Part II; for now I plan only to sketch some of the 
qualitative features of the solutions, obtained by neglecting the electron repulsion 
term altogether. In section 5.2.1 we’ll study the ground state and excited states of 
helium, and in section 5.2.2 we’ll examine the ground states of higher atoms. 


6 I’m assuming the nucleus is stationary. The trick of accounting for nuclear motion by using the 
reduced mass (Problem 5.1) works only for the two -body problem—hydrogen; fortunately, the nucleus is so 
much more massive than the electrons that the correction is extremely small even in that case (see Problem 
5.2a), and it is smaller still for the heavier atoms. There are more interesting effects, due to magnetic 
interactions associated with electron spin, relativistic corrections, and the finite size of the nucleus. We’ll 
look into these in later chapters, but all of them are minute corrections to the “purely Coulombic” atom 
described by Equation 5.24. 
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Problem 5.7 Suppose you could find a solution i/r(ri, r 2 , ..., r z ) to the 
Schrodinger (Equation 5.25) for the Hamiltonian in Equation 5.24. Describe how 
you could construct from it a completely symmetric function and a completely an¬ 
tisymmetric function, which also satisfy the Schrodinger equation, with the same 
energy._ 


5.2.1 Helium 

After hydrogen, the simplest atom is helium (Z = 2). The Hamiltonian, 

h 2 2 1 2e 2 ] 1 e 2 

+ 4 ^ [5 ' 27! 

consists of two hydrogenic Hamiltonians (with nuclear charge 2e), one for electron 
1 and one for electron 2, together with a final term describing the repulsion of the 
two electrons. It is this last term that causes all the problems. If we simply ignore it, 
the Schrodinger equation separates, and the solutions can be written as products of 
hydrogen wave functions: 


2 m 4tt6o 


r 1? r 2 ) = [5.28] 

only with half the Bohr radius (Equation 4.72), and four times the Bohr energies 
(Equation 4.70). The total energy would be 

E = 4 (E„ + E„0, [5.29] 

where E n = —13.6 fn 2 eV. In particular, the ground state would be 

^o(ri, r 2 ) = T/ f ioo( r i)t/ r ioo( r 2 ) = -^-e~ 1(ri+r2)/a [5.30] 

7ta i 

(see Equation 4.80), and its energy would be 

Eq = 8(—13.6eV) = —109eV. [5.31] 

Because xJ/q is a symmetric function, the spin state has to be antisymmetric, so the 
ground state of helium is a singlet configuration, with the spins “oppositely aligned”. 
The actual ground state of helium is indeed a singlet, but the experimentally deter¬ 
mined energy is —78.975 eV, so the agreement is not very good. But this is hardly 
surprising: We ignored electron repulsion, which is certainly not a small contribution. 
It is clearly positive (see Equation 5.27), which is comforting—evidently it brings the 
total energy up from —109 to —79 eV (see Problem 5.10). 

The excited states of helium consist of one electron in the hydrogenic ground 
state and the other in an excited state: 


fnlm^ 100 - 


[ 5 . 32 ] 
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[If you try to put both electrons in excited states, one immediately drops to the ground 
state, releasing enough energy to knock the other one into the continuum (E > Oi. 
leaving you with a helium ion (He + ) and a free electron. This is an interesting sys¬ 
tem in its own right—see Problem 5.8—but it is not our present concern.] We can 
construct from this both symmetric and antisymmetric combinations, in the usual 
way (Equation 5.10); the former go with the antisymmetric spin configuration (the 
singlet), and they are called parahelium, while the latter require a symmetric spin 
configuration (the triplet), and they are known as orthohelium. The ground state 
is necessarily parahelium; the excited states come in both forms. Because the sym¬ 
metric spatial state brings the electrons closer together (as we discovered in Section 
5.1.2), we expect a higher interaction energy in parahelium, and indeed it is exper¬ 
imentally confirmed that the parahelium states have somewhat higher energy than 
their orthohelium counterparts (see Figure 5.2). 

Problem 5.8 

(a) Suppose you put both electrons in a helium atom into the n = 2 state; what 
would the energy of the emitted electron be? 

(b) Describe (quantitatively) the spectrum of the helium ion, He + . 

Problem 5.9 Discuss (qualitatively) the energy level scheme for helium (a) if elec¬ 
trons were identical bosons, and (b) if electrons were distinguishable particles (but 
still with the same mass and charge). Pretend the electrons still have spin 1 /2. 

**Problem 5.10 

(a) Calculate {(l/ln — r 2 |)) for the state (Equation 5.30). Hint Do the d 3 r : 
integral first, using spherical coordinates and setting the polar axis along iq, so 
that 

|ri - r 2 | = ^ + - 2r,r 2 cos0 2 . 

The 62 integral is easy, but be careful to take the positive root. You’ll have to 
break the r 2 integral into two pieces, one ranging from 0 to r \, the other from 
r\ to 00 . Answer. 5/4 a. 

(b) Use your result in (a) to estimate the electron interaction energy in the ground 
state of helium. Express your answer in electron volts, and add it to Eq (Equa¬ 
tion 5.31) to get a corrected estimate of the ground-state energy. Compare the 
experimental value. Note: Of course, we’re still working with an approximate 
wave function, so don’t expect perfect agreement. 
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Parahelium Orthohelium 



Figure 5.2: Energy level diagram for helium (the notation is explained in Section 
5.2.2). Note that parahelium energies are uniformly higher than their orthohelium 
counterparts. The numerical values on the vertical scale are relative to the ground 
state of ionized helium (He+): 4 x (-13.6 eV) = -54,4 eV; to get the total energy of 
the state, subtract 54.4 eV. 


5.2.2 The Periodic Table 

The ground-state electron configurations for heavier atoms can be pieced together 
in much the same way. To first approximation (ignoring their mutual repulsion al¬ 
together), the individual electrons occupy one-particle hydrogenic states (n, l, m), 
called orbitals, in the Coulomb potential of a nucleus with charge Ze. If electrons 
were bosons (or distinguishable particles), they would all shake down to the ground 
state (1,0,0), and chemistry would be very dull indeed. But electrons are in fact iden¬ 
tical fermions, subject to the Pauli exclusion principle, so only two can occupy any 
given orbital (one with spin up, and one with spin down—or, more precisely, in the 



Chap. 5 Identical Particles 


singlet configuration). There are n 2 hydrogenic wave functions (all with the same 
energy E n ) for a given value of n , so the n — 1 shell has room for two electrons, the 
n = 2 shell holds eight, n = 3 takes 18, and in general the nth shell can accomodate 
In 2 electrons. Qualitatively, the horizontal rows on the Periodic Table correspond 
to filling out each shell (if this were the whole story, they would have lengths 2. 
8, 18, 32, 50, etc., instead of 2, 8, 8, 18, 18, etc.; we’ll see in a moment how the 
electron-electron repulsion throws the counting off). 

With helium, the n = 1 shell is filled, so the next atom, lithium (Z = 3), has 
to put one electron into the n = 2 shell. Now, for n =2 we can have / = 0 or 
1=1; which of these will the third electron choose? In the absence of electron- 
electron interactions, they both have the same energy (the Bohr energies depend on 
n, remember, but not on /). But the effect of electron repulsion is to favor the lowest 
value of /, for the following reason: Angular momentum tends to throw the electron 
outward (more formally, the expectation value of r increases with increasing /, for a 
given n), and the farther out an electron gets, the more effectively the inner electron^ 
screen the nucleus (roughly speaking, the innermost electron “sees” the full nuclear 
charge Ze, but the outermost electron sees an effective charge hardly greater than e i. 
Within a given shell, therefore, the state with lowest energy (which is to say, the most 
tightly bound electron) is / = 0, and the energy increases with increasing l. Thus the 
third electron in lithium occupies the orbital (2,0,0). The next atom (beryllium, with 
Z = 4) also fits into this state (only with “opposite spin”), but boron (Z = 5) has to 
make use of / = 1. 

Continuing in this way, we reach neon (Z = 10), at which point the n = 2 shell 
is filled, and we advance to the next row of the periodic table and begin to populate the 
n = 3 shell. First there are two atoms (sodium and magnesium) with l = 0, and then 
there are six with / = 1 (aluminum through argon). Following argon there “should" 
be 10 atoms with n = 3 and 1 = 2; however, by this time the screening effect is so 
strong that it overlaps the next shell, so potassium (Z = 19) and calcium (Z = 20) 
choose n = 4, l = 0, in preference to n = 3, / = 2. After that we drop back to pick 
up the n = 3, / = 2 stragglers (scandium through zinc), followed by n =4,1 = 1 
(gallium through krypton), at which point we again make a premature jump to the 
next row (n = 5) and wait until later to slip in the 1 = 2 and 1 = 3 orbitals from the 
n = 4 shell. For details of this intricate counterpoint, refer to any book on atomic 
physics. 7 

I would be delinquent if I failed to mention the archaic nomenclature for atomic 
states, because all chemists and most physicists use it (and the people who make up the 
Graduate Record Exam love this kind of thing). For reasons known best to nineteenth- 
century spectroscopists, / = 0 is called s (for “sharp”), / = 1 is p (for “principal”). 
/ = 2 is d (“diffuse”), and / = 3 is / (“fundamental”); after that I guess they ran out of 


7 See, for example, U. Fano and L. Fano, Basic Physics of Atoms and Molecules (New York: John 
Wiley & Sons, 1959), Chapter 18, or the classic by G. Herzberg, Atomic Spectra and Atomic Structure 
(New York: Dover, 1944). 
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imagination, because the list just continues alphabetically ( g , h,i, etc.). 8 The state of 
a particular electron is represented by the pair nl, with n (the number) giving the shell 
and / (the letter) specifying the orbital angular momentum; the magnetic quantum 
number m is not listed, but an exponent is used to indicate the number of electrons 
that occupy the state in question. Thus the configuration 

(ls) 2 (2s) 2 (2p) 2 [5.33] 

tells us that there are two electrons in the orbital (1,0,0), two in the orbital (2,0,0), and 
two in some combination of the orbitals (2,1,1), (2,1,0), and (2,1,—1). This happens 
to be the ground state of carbon. 

In that example there are two electrons with orbital angular momentum quantum 
number 1, so the total orbital angular momentum quantum number L (capital L, 
instead of l, to indicate that this pertains to the total , not to any one particle) could be 
2, 1, or 0. Meanwhile, the two (Is) electrons are locked together in the singlet state, 
with total spin zero, and so are the two (2s) electrons, but the two (Ip) electrons could 
be in the singlet configuration or the triplet configuration. So the total spin quantum 
number S (capital, again, because it’s the total ) could be 1 or 0. Evidently the grand 
total (orbital plus spin) J could be 3, 2, 1, or 0. There exist rituals (Hund’s rules 9 ) 
for figuring out what these totals will be, for a particular atom. The result is recorded 
as the following hieroglyphic: 


25+1 Lj [5.34] 

(where S and J are the numbers, and L the letter—capitalized, this time, because 
we’re talking about the totals). The ground state of carbon happens to be 3 Po'. The 
total spin is 1 (hence the 3), the total orbital angular momentum is 1 (hence the P), and 
the grand total angular momentum is zero (hence the 0). In Table 5.1 the individual 
configurations and the total angular momenta (in the notation of Equation 5.34) are 
listed, for the first four rows of the Periodic Table. 

***Problem 5.11 

(a) Figure out the electron configurations (in the notation of Equation 5.33) for the 
first two rows of the Periodic Table (up to neon), and check your results against 
Table 5.1. 

(b) Figure out the corresponding total angular momenta, in the notation of Equation 
[5.34], for the first four elements. List all the possibilities for boron, carbon, 
and nitrogen. 


8 The shells themselves are assigned equally arbitrary nicknames, starting (don’t ask me why) with 
K: the K shell is n = 1, the L shell is n = 2, M is n = 3, and so on (at least they’re in alphabetical order). 

9 See, for example, Stephen Gasiorowicz, Quantum Physics (New York: John Wiley & Sons, 1974), 
Chapters 18 and 19. 
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Table 5.1: Ground-state electron configurations for the first four rows of the 
Periodic Table. 


z 

Element 

Configuration 


1 

H 

(Is) 

2 ft/2 

2 

He 

(Is) 2 

l S 0 

3 

Li 

(He) (25) 

2 ft/2 

4 

Be 

(He)(25) 2 

J ft 

5 

B 

(He)(25) 2 (2 p) 

2 P,/2 

6 

C 

(He)(2s) 2 (2/?) 2 

3 ft) 

7 

N 

(He)(25) 2 {2pj* 

4 ft/ 2 

8 

0 

(He)(25) 2 (2/7) 4 

3 Pi 

9 

F 

(He)(2s) 2 (2/?) 5 

2 ft/2 

10 

Ne 

(He)(2s) 2 (2/?) 6 

% 

11 

Na 

(Ne)(3s) 

2 Sl,2 

12 

Mg 

(Ne)(35) 2 

l So 

13 

A1 

(Ne)(35) 2 (3ft 

2 Plf2 

14 

Si 

(Ne)(35) 2 (3p) 2 

3 ft 

15 

P 

(Ne)(35) 2 (3/?) 3 

4 ft/ 2 

16 

S 

(Ne)(35) 2 (3/0 4 

2 Pi 

17 

Cl 

(Ne)(35) 2 (3/>) 5 

2 ft/2 

18 

Ar 

(Ne)(35) 2 (3/7) 6 

% 

19 

K 

(At)(4s) 

2 ft/2 

20 

Ca 

(Ar)(45) 2 

% 

21 

Sc 

(Ar)(45) 2 (3i) 

2 D 3/2 

22 

Ti 

(Ar)(4s) 2 (3ft) 2 

3 ft 

23 

V 

(Ar)(4s) 2 (3</) 3 

4 ft/2 

24 

Cr 

(Ar)(45)(3rf) 5 

7 ft 

25 

Mn 

(Ar)(4j) 2 (3d) 5 

6 ft/2 

26 

Fe 

(Ar)(4s) 2 (3d) 6 

5 £>4 

27 

Co 

(Ar)(45) 2 (3^) 7 

4 7*9/2 

28 

Ni 

(Ar)(4s) 2 (3rf) 8 

3 ft 

29 

Cu 

(Ar)(45)(3t/) 10 

2 ft/2 

30 

Zn 

(Ar)(45) 2 (3rf) 10 

‘ft 

31 

Ga 

(Ar)(4 j ) 2 (3d) 10 (4/j) 

2 ft/2 

32 

Ge 

(Ar)(45) 2 (3^) 10 (4/j) 2 

3 ft 

33 

As 

(Ar)(4j ) 2 (3d) 10 (4/?) 3 

4 ft/2 

34 

Se 

(Ar)(4 j ) 2 (3d) 10 (4/?) 4 

3 ft 

35 

Br 

(Ar)(45) 2 (3d) 10 (4/7) 5 

2 ft/2 

36 

Kr 

(Ar)(45) 2 (3d) 10 (4p) 6 

l So 


(c) Hund’s first rule says that, all other things being equal, the state with the highest 
total spin will have the lowest energy. What would this predict in the case of 
the excited states of helium? 
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(d) Hund’s second rule says that if a subshell ( n, l ) is no more than half filled, 
then the lowest energy level has J — \L — SI; if it is more than half filled, then 
J = L + S has the lowest energy. Use this to resolve the boron ambiguity in 

(b). 

(e) Use Hund’s rules and the fact that a symmetric spin state must go with an 
antisymmetric position state (and vice versa) to resolve the carbon ambiguity in 
(b). What can you say about nitrogen? 


Problem 5.12 The ground state of dysprosium (element 66, in the sixth row of the 
Periodic Table) is listed as 5 /g. What are the total spin, total orbital, and grand total 
angular momentum quantum numbers? Suggest a likely electron configuration for 
dysprosium. _ 


5.3 SOUPS _ 

In the solid state, a few of the loosely bound outermost valence electrons in each atom 
become detached and roam around throughout the material, no longer subject only to 
the Coulomb field of a specific “parent” nucleus, but rather to the combined potential 
of the entire crystal lattice. In this section we will examine two extremely primitive 
models: first, the electron gas theory of Sommerfeld, which ignores all forces (except 
the confining boundaries), treating the wandering electrons as free particles in a 
box (the three-dimensional analog to an infinite square well); and second, Bloch’s 
theory, which introduces a periodic potential representing the electrical attraction of 
the regularly spaced, positively charged, nuclei (but still ignores electron-electron 
repulsion). These models are no more than the first halting steps toward a quantum 
theory of solids, but already they reveal the critical role of the Pauli exclusion principle 
in accounting for the “solidity” of solids, and provide illuminating insight into the 
remarkable electrical properties of conductors, semiconductors, and insulators. 

5.3.1 The Free Electron Gas 

Suppose the object in question is a rectangular solid, with dimensions l x , l y , l z , and 
imagine that an electron inside experiences no forces at all, except at the impenetrable 
walls: 


V(x,y, z ) 


0, if (0 < jc < / x , 0 < y < l y , 0 < z < 4); 
oo, otherwise. 


[5.35] 


ti 2 

-^-VV = Ef, 

2m 


The Schrodinger equation, 
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separates in Cartesian coordinates: \jf{x,y,z) = X(x)Y(y)Z(z), with 

fi 2 d 2 X ft 2 d 2 Y h 2 d 2 Z 

2m dxi ~ ExX ' 2^dY _ EyY ' ~2~ E ‘ Z ’ 
and E = E x + E y + E z . Letting 



we obtain the general solutions 

X(x) = A x sin(* x .r) + B x cos (k x x), Y{y) = A y $m(k y y) + B y cos(k y y), 


Z(z) = A z sin (k z z) + B z cos(k z z). 


The boundary conditions require that X(0) = 7(0) = Z{0) = 0, so B x 
B z = 0, and X(l x ) = Y(l y ) = Z(l z ) = 0, so 

k x l x — n x n, k y ly = n y 7i;, k z l z = n z 7t, 
where each n is a positive integer: 

n x = 1, 2, 3, n y = 1, 2, 3, ..., n z = 1, 2, 3. 

The (normalized) wave functions are 

and the allowed energies are 



= By = 
[5.36: 

[5.37] 


[5.38] 



where k is the magnitude of the wave vector k = (k x ,k y , k z ). 

If you imagine a three-dimensional space, with axes k x , k y , k z , and planes drawn 
in at** = (n/l x ), (2tv //*), (3 n/l x ), ..., at k y = ( it/l y ), (2n/l y ), (3 n/l y ), ..., and 
at k z = {it j / z ), {lit/ / z ), (3it/l z ), , each intersection point represents a distinct 
(one-particle) stationary state (Figure 5.3). Each block in this grid, and hence also 
each state, occupies a volume 


7T 3 It 3 


[5.40] 
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Figure 5.3: Free electron gas. Each intersection on the grid represents an al¬ 
lowed energy. Shading indicates one block; there is one state for every block. 


of “A:-space;’ where V = l x l y h is the spatial volume of the object itself. Now 
suppose our sample contains N atoms, and each atom contributes q free electrons. 
(In practice, N will be enormous—on the order of Avogadro’s number, for an object 
of macroscopic size.) If electrons were bosons (or distinguishable particles), they 
would all settle down to the ground state, ^lii- 10 But electrons are in fact identical 
fermions subject to the Pauli exclusion principle, so only two of them can occupy any 
given state. They will fill up one octant of a sphere in A>space, u whose radius kp is 
determined by the fact that each pair of electrons requires a volume 7r 3 / V (Equation 
5.40): 


10 I’m assuming there is no appreciable thermal excitation, or other disturbance, to lift the solid out 
of its collective ground state. If you like, I’m talking about a “cold” solid, though (as you will show in 
Problem 5.13c), typical solids are still “cold,” in this sense, far above room temperature. 

11 Because N is such a huge number, we need not worry about the distinction between the actual 
jagged edge of the grid and the smooth spherical surface that approximates it. 
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Thus 


where 


(H-?(£)• 

k F = (3^7T 2 ) 1/3 , [5.41: 


N<7 






[5.42; 


is the free electron density (the number of free electrons per unit volume). 

The boundary separating occupied and unoccupied states, in £-space, is called 
the Fermi surface (hence the subscript F). The maximum occupied energy is called 
the Fermi energy £>; evidently, for a free electron gas, 


Ep = ^(3p?r 2 ) 2/3 . [5- 43 l 

The total energy of the electron gas can be calculated as follows: a shell of thickness 
dk (Figure 5.4) contains a volume 


i(4 wk 2 )dk, 

so the number of electron states in the shell is 

2[(l/2 )nk 2 dk] V ,j „ 

OtyV) ~ ^ 



Figure 5.4: One octant of a spherical shell in £-space. 
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Each of these states carries an energy h 2 k 2 /2m (Equation 5.39), so the energy of the 
shell is 

Pi 2 k 2 V 

dE =-^-—k 2 dK [5.44] 

2m 7T 2 

and hence the total energy is 


= Jv_ C" 4 fiV F V = h 2 (3n 2 Ngf/ 3 ( ,_ 2/3 . 
2 7 Z 2 m J 0 10 7i 2 m I0n 2 m 


[5.45] 


This quantum mechanical energy plays a role rather analogous to the internal 
thermal energy ( U ) of an ordinary gas. In particular, it exerts a pressure on the walls, 
for if the box expands by an amount dV, the total energy decreases: 


dE lo 


3 10jt 2 w 


2 dV 
3^ tot V ' 


and this shows up as work done on the outside (dW = PdV) by the quantum pressure 
P. Evidently 


/> = ? *= 
3 V 


2 h 2 k 5 F (3 it 2 ?/^ 2 


[5.46] 


3 107T 2 m 5m 

Here, then, is a partial answer to the question of why a cold solid object doesn’t 
simply collapse : There is a stabilizing internal pressure that has nothing to do with 
electron-electron repulsion (which we have ignored) or thermal motion (which we 
have excluded) but is strictly quantum mechanical, and derives ultimately from the 
antisymmetrization requirement for the wave functions of identical fermions. It is 
sometimes called degeneracy pressure, although “exclusion pressure” might be a 
better term. 12 


Problem 5.13 The density of copper is 8.96 gm/cm 3 , and its atomic weight is 63.5 
gm/mole. 


(a) Calculate the Fermi energy for copper (Equation 5.43). Assume q.~ 1, and 
give your answer in electron volts. 

(b) What is the corresponding electron velocity [set E F = (1/2)mu 2 ]? Is it safe to 
assume that the electrons in copper are nonrelativistic? 

(c) At what temperature would the characteristic thermal energy (k B T, where k B is 
the Boltzmann constant and T is the Kelvin temperature) equal the Fermi energy, 
for copper? Note: This is called the Fermi temperature. As long as the actual 
temperature is substantially below the Fermi temperature, the material can be 
regarded as “cold”, with most of the electrons in the ground-state configuration. 
Since the melting point of copper is 1356 K, solid copper is always cold. 


12 We derived Equations 5.41, 5.43, 5.45, and 5.46 for the special case of an infinite rectangular 
well; but they hold for containers of any shape as long as the number of particles is extremely large. 
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(d) Calculate the degeneracy pressure (Equation 5.46) of copper, in the electron gas 
model. 


Problem 5.14 The bulk modulus of a substance is the ratio of a small decrease 
in pressure to the resulting fractional increase in volume: 


Show that B = (5/3) P, in the free electron gas model, and use your result in Prob¬ 
lem 5.13(d) to estimate the bulk modulus of copper. Note : The observed value is 
13.4 x 10 10 N/m 2 , but don’t expect perfect agreement—after all, we’re neglecting 
all electron-nucleus and electron-electron forces! Actually, it is rather surprising that 
this calculation comes as close as it does. 


5.3.2 Band Structure 

We’re now going to improve on the free electron model by including the forces exerted 
on the electrons by the regularly spaced, positively charged, essentially stationary 
nuclei. The qualitative behavior of solids is dictated to a remarkable degree by 
the mere fact that this potential is periodic —its actual shape is relevant only to the 
finer details. To show you how it goes, I’m going to develop the simplest possible 
example: a one-dimensional Dirac comb, consisting of evenly spaced delta-function 
wells (Figure 5.5). But before I get to that, we need to know a bit about the general 
theory of periodic potentials. 

Consider, then, a single particle subject to a periodic potential in one dimension: 

V{x+a) = V(x). [5.47] 

Bloch’s theorem tells us that the solutions to the Schrodinger equation, 

h 2 d 2 \jf 

~ 2 + = [5 ' 481 



Figure 5.5: The Dirac comb, Equation 5.57. 
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for such a potential, can be taken to satisfy the condition 

i j/(x + a) — e iKa is(x) [5.49] 

for some constant K. 

Proof: Let D be the “displacement” operator: 

Df{x) = f(x+a). [5.50] 

By virtue of Equation 5.47, D commutes with the Hamiltonian: 

[£>,//] =0, [5.51] 

and hence (see Section 3.4.1) we are free to choose eigenfunctions of H that 
are simultaneously eigenfunctions of D: D\f — Xfi, or 

f/(x + a) — X\Jf(x). [5.52] 

Now X is certainly not zero [if it were, then—since Equation 5.52 holds for 
all x —we would immediately obtain f{x) = 0, which is not a permissible 
eigenfunction], so, like any nonzero complex number, it can be expressed as an 
exponential: 

X = e iKa [5.53] 

for some constant K. QED. 

At this stage Equation 5.53 is merely a strange way to write the eigenvalue X, 
but in a moment we will discover that K is in fact real, so that although \jf{x) itself 
is not periodic, |t/r(x)| 2 is: 

\f{x + a)\ 2 — \fr{x)\ 2 , [5.54] 

as one would certainly expect. 13 

Of course, no real solid goes on forever, and the edges are going to spoil the 
periodicity of V(x) and render Bloch’s theorem inapplicable. However, for any 
macroscopic crystal, containing something on the order of Avogadro’s number of 
atoms, it is hardly imaginable that edge effects can significantly influence the behav¬ 
ior of electrons deep inside. This suggests the following device to salvage Bloch’s 
theorem: We wrap the x-axis around in a circle and connect it onto its tail, after a 
large number N ^ 10 23 of periods; formally, we impose the boundary condition 

yj/{x + Na) = fix). [5.55] 


13 Indeed, you might be tempted to reverse the argument, starting with Equation 5.54, as a way 
of proving Bloch’s theorem. It doesn’t work, for Equation 5.54 alone would allow the phase factor in 
Equation 5.49 to be a function of x. 
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It follows (from Equation 5.49) that 


e iNKa f(x) = f(x). 


so e lNKa = 1, or NKa = 2nn, or 


K 


2jtn 

~Na' 


(n = 0, ±1, ±2, ...). 


[5.56] 


In particular, for this arrangement K is necessarily real. The virtue of Bloch’s theorem 
is that we need only solve the Schrodinger equation within a single cell (say, on the 
interval 0 < x < a); recursive application of Equation 5.49 generates the solution 
everywhere else. 

Now suppose the potential consists of a long string of delta-function wells (the 
Dirac comb): 

N -1 

V(x) = -aJ2*(x-ja). [5.57] 

j=0 


The wells are supposed to represent, very crudely, the electrical attraction of the nuclei 
in the lattice. (In Figure 5.5 you must imagine that the x-axis has been “wrapped 
around,” as suggested in the previous paragraph, so the Nth well actually appears at 
x = —a.) No one would pretend that this is a realistic model, but remember, it is only 
the effect of periodicity that concerns us here; the classic study 14 used a repeating 
rectangular pattern, and many authors still prefer that one. 15 In the region 0 < x < a 
the potential is zero, so 


h z d 2 f 
2mlh? 


= E\j/, 


or 


d 2 f 

dx 2 


— —k 2 \fr. 


where 


k = 



[5.58] 


as usual. (I’ll work out the positive-energy solutions; the negative-energy states can 
be obtained in exactly the same way, using k = \^—2mE/Ti, or by simply substituting 
k Ik in the final result [5.64].) 

The general solution is 


i/r(x) = Asin(fcx) + Z?cos(A:x), (0 < x < a). [5.59] 


14 R. de L. Kronig and W. G. Penney, Proc. R. Soc. Lond., ser. A, 130, 499 (1930). 

15 See, for instance, D. Park, Introduction to the Quantum Theory , 3rd ed,, (New York: McGraw-Hill, 

1992). 
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According to Bloch’s theorem, the wave function in the cell immediately to the left 
of the origin is 

= e~ lKa [A sin£(x + a) + Bcosk(x + a)], (—a < x < 0). [5.60] 

At v — 0, \}r must be continuous, so 

B — e~‘ Ka [A sin(Ara) + 2?cos(£a)]; [5.61] 

its derivative suffers a discontinuity proportional to the strength of the delta function 
(see Equation 2.107): 

kA — e~~ lKa k[A cos (ka) — B sin(Ara)] = —[5.62] 

Solving Equation 5.61 for A sin(Ara) yields 

A sin(Ara) = [ e ,Ka — cos(£a)]5. [5.63] 

Substituting this into Equation 5.62, and canceling kB , we find 

\e iKa - cos(A:a)][l - e~ iKa cos(£a)] + e~ lKa sin 2 (A:a) = sin {ka), 

n k 

which simplifies to 

cos (Ka) = cos (ka) -r- sin(Aa). [5.64] 

hk 

This is the fundamental result from which all else follows. For the Kronig-Penney po¬ 
tential (see footnote 15), the formula is more complicated, but it shares the qualitative 
features we are about to explore. 

Equation 5.64 determines the possible values of k, and hence the allowed en¬ 
ergies. To simplify the notation, let 

moia 

z = ka , and /J - —y-, [5.65] 

fi 

so the right side of Equation 5.64 can be written 

/(z) = cos(z) — {3 ^ ^ \ [5.66] 

The constant fi is a dimensionless measure of the “strength” of the delta function. In 
Figure 5.61 have plotted f(z) for the case fi = 1. The important thing to notice is that 
/(z) strays outside the range (-1,4-1), and in such regions there is no hope of solving 
Equation 5.64, since | cos(Ka)\ < 1. These gaps represent forbidden energies; they 
are separated by bands of allowed energies. Within a given band, virtually any energy 
is allowed, for according to Equation 5.56 Ka = 2jtn/N, where A is a huge number, 
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band band band band 

Figure 5.6: Graph of f(z) (Equation 5.66) for p = 1, showing allowed bands 
separated by forbidden gaps. 


and n can be any integer. You might imagine drawing N horizontal lines on Figure 
5.6, at values of cos(27rn /N) ranging from +1 (n = 0) down to — 1 (n = N /2), anc 
back almost to +1 (n — N — 1)—at this point the Bloch factor e l Ka recycles, so no new 
solutions are generated by further increasing n . The intersection of each of these line', 
with f(z) yields an allowed energy. Evidently there are N/2 positive-energy state" 
in the first band (joined by Njl negative-energy states) and N in all the higher bands, 
they are so closely spaced that for most purposes we can regard them as forming a 
continuum (Figure 5.7). 

So far, we’ve only put one electron in our potential. In practice there will be 
Nq of them, where q is again the number of “free” electrons per atom. Because ot 





band 


gap 


band 


gap 

band 


Figure 5.7: The allowed positive 
energies for a periodic potential. 
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the Pauli exclusion principle, only two electrons can occupy a given spatial state, 
so if q = 1, they will just fill the negative-energy half of the first band, if q — 2 
they will completely fill the first band, if q = 3 they half fill the second band, and 
so on. (In three dimensions, and with more realistic potentials, the band structure 
may be more complicated, but the existence of bands, separated by forbidden gaps, 
persists—band structure is the signature of a periodic potential.) Now, if a band is 
entirely filled, it takes a relatively large energy to excite an electron, since it has to 
jump across the forbidden zone. Such materials will be electrical insulators. On 
the other hand, if a band is only partly filled, it takes very little energy to excite an 
electron, and such materials are typically conductors. If you dope an insulator with 
a few atoms of larger or smaller q, this puts some “extra” electrons into the next 
higher band, or creates some holes in the previously filled one, allowing in either 
case for weak electric currents to flow; such materials are called semiconductors. In 
the free electron model all solids should be excellent conductors, since there are no 
large gaps in the spectrum of allowed energies. It takes the band theory to account 
for the extraordinary range of electrical conductivities exhibited by the solids in 
nature. 


Problem 5.15 

(a) Using Equations 5.59 and 5.63, show that the wave function for a particle in the 
periodic delta function potential can be written in the form 

fix) = C[sin(£x) + e~ iKa sin k(a - x)], (0 < x < a). 

(Don’t bother to determine the normalization constant C.) 

(b) There is an exception: At the bottom of a band, where ka = Ka = jn , (a) 
yields f(x) = 0. Find the correct wave function for this case. Note what 
happens to f at each delta function. 

Problem 5.16 Find the energy at the top of the first allowed band, for the case 
f} = 5, correct to three significant digits. For the sake of argument, assume a/a — 1 
eV. 

Problem 5.17 Suppose we used delta-function spikes , instead of wells (so that 
the electrons are repelled , instead of attracted, by the nuclei). Draw the analogs to 
Figures 5.6 and 5.7 (using the same values of the parameters—except for their signs). 
How many allowed energies are there in each band? What is the energy at the top of 
the jth band? 

Problem 5.18 Show that most of the energies determined by Equation 5.64 are dou¬ 
bly degenerate. What are the exceptional cases? Hint: Try it for A — 1, 2, 3, 4, ..., 
to see how it goes. What are the possible values of cos(^a) in each case? 
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5.4 QUANTUM STATISTICAL MECHANICS 


At absolute zero, a physical system occupies its lowest energy configuration. As v> r 
turn up the temperature, random thermal activity will begin to populate the excitec 
states, and this raises the following question: If we have a large number N of particle' 
in thermal equilibrium at temperature T, what is the probability that a given particle 
would be found to have a specific energy Efi Note that the “probability” in questior 
has nothing to do with quantum indeterminacy—exactly the same question arises ir 
classical statistical mechanics. The reason we must be content with a probabilistic 
answer is that we are typically dealing with enormous numbers of particles, and w c 
could not possibly expect to keep track of each one separately, whether or not the 
underlying mechanics is deterministic. 

The fundamental assumption of statistical mechanics is that in thermal 
equilibrium every distinct state with the same total energy E is equally probable 
Random thermal motions constantly shift energy from one particle to another anc 
from one form (rotational, kinetic, vibrational, etc.) to another, but (absent externa, 
influences) the total is fixed by conservation of energy. The assumption (and it's - 
deep one, worth thinking about) is that this continual redistribution of energy doe-' 
not favor any particular state. The temperature T is simply a measure of the total 
energy, for a system in thermal equilibrium. The only new twist introduced b> 
quantum mechanics has to do with how we count the distinct states , and this depend' 
critically on whether the particles involved are distinguishable, identical bosons, or 
identical fermions. The arguments are relatively straightforward, but the arithmetic 
gets pretty dense, so I’m going to begin with an absurdly simple example, so you‘1. 
have a clear sense of what is at issue when we come to the general case. 

5.4.1 Example 

Suppose we have just three noninteracting particles (all of mass m) in the one¬ 
dimensional infinite square well (Section 2.2). The total energy is 

7t 2 h 2 

E = E A +E B + E c =-^—^{n^+nl+nl) [5.67; 

(see Equation 2.23), where n A , n B , and «c are positive integers. Now suppose, for 
the sake of argument, that E — 243(7r 2 /z 2 /2ma 2 ), which is to say, 

n\ +n 2 B +n 2 c =243. [5.68; 

There are, as it happens, 10 combinations of positive integers, the sum of whose 
squares is 243: All three could be 9, or two could be 3 and one 15 (which occurs in 
three permutations), or one could be 5, one 7, and one 13 (six permutations). Thu' 
(n A , n B , nc) is one of the following: 
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(9,9,9), 

(3,3, 15), (3, 15,3), (15,3,3), 

(5,7,13), (5,13,7), (7,5,13), (7,13,5), (13,5,7), (13,7,5). 

If the particles are distinguishable, each of these represents a distinct quantum 
state, and the fundamental assumption of statistical mechanics says that in thermal 
equilibrium 16 they are all equally likely. But I’m not interested in knowing which 
particle is in which (one-particle) state, only the total number of particles in each 
state — the occupation number N n for the state i fs n . The collection of all occupation 
numbers for a given three-particle state we will call the configuration. If all three 
are in i f/ 9 , the configuration is 

(0, 0, 0, 0, 0, 0, 0, 0, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0,...) [5.69] 

(i.e., Nq = 3, all others zero). If two are in xjr 3 and one is in 1 /^ 5 , the configuration is 

(0, 0, 2 , 0,0, 0, 0,0, 0, 0, 0,0, 0, 0, 1 , 0, 0,...) [5.70] 

(i.e., Nt, = 2, N\$ = 1, all others zero). And if there is one particle in 1 ^ 5 , one in xj/q, 
and one in t/r 13? the configuration is 

(0, 0, 0, 0, 1,0, 1, 0, 0, 0, 0, 0, 1, 0, 0, 0, ...) [5.71] 

(i.e., N$ — Nj = iVj 3 = 1, all others zero.) Of these, the third is the most probable 
configuration, because it can be achieved in six different ways, whereas the second 
occurs three ways, and the first only one. 

Returning now to my original question, if we select one of these three particles 
at random, what is the probability ( P n ) of getting a specific (allowed) energy E n l 
Well, the chances are 1 in 10 that the system is in the first configuration (Equation 
5.69), and in that event we are certain to get E 9 , so P 9 = 1/10. The chances are 
3 in 10 that the system is in the second configuration (Equation 5.70), and in that 
case there is a 2/3 probability of getting £ 3 , and 1/3 probability of getting E 15 , so 
P 3 = (3/10) x (2/3) = 1/5, and P 15 = (3/10) x (1/3) = 1/10. And the chances 
are 6 in 10 that the system is in the third configuration, in which case the probability is 
1/3 each that we’ll get E 5 , E-j, and Eu, so P 5 = P 1 — P l3 = (6/10) x (1 /3) = 1 /5. 
As a check, we note that 


l6 How the particles maintain thermal equilibrium, if they really don’t interact at all, is a problem I’d 
rather not worry about—maybe God reaches in periodically and stirs things up (being careful not to add 
or remove any energy). In real life, of course, the continual redistribution of energy is caused precisely 
by interactions between the particles, so if you don’t approve of divine intervention let there be extremely 
weak interactions—sufficient to thermalize the system (at least, over long time periods) but too small to 
alter the stationary states and the allowed energies appreciably. 
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P 3 + P 5 + P 7 + P 9 + P u + P 15 


: 5 + 5' 


1 1 1 

- - -|- — -|- - — 1 . 

10 5 10 


But that’s when the particles are distinguishable. If in fact they are identica. 
fermions, the antisymmetrization requirement (leaving aside spin, for simplicity) ex¬ 
cludes the first and second configurations (which assign two—or, worse still, three- 
particles into the same state), and there is just one state with the third configuration 
(see Problem 5.19). For identical fermions, then, P 5 = Pi = P ]3 — 1/3 (and again 
the sum of the probabilities is 1). On the other hand, if they are identical bosons the 
symmetrization requirement allows for one state with each configuration (see Prob¬ 
lem 5.19), so P 9 = 1/3, P 3 = (1/3) X (2/3) = 2/9, P ]5 = (1/3) x (1/3) - 1/9. 
and P 5 = P 7 — P \ 3 = (1/3) x (1/3) = 1/9. As always, the sum is 1. 

The purpose of this example was to show you how the counting of states depend' 
on the nature of the particles. In one respect it was actually more complicated than 
the realistic situation, in which N is a huge number. For as N grows, the most 
probable configuration (in this example, N$ = N 7 = N ]3 = 1, for the case ot 
distinguishable particles) becomes overwhelmingly more likely than its competitors, 
so that, for statistical purposes, we can afford to ignore the others altogether. Tin 
distribution of individual particle energies, at equilibrium, is simply their distribution 
in the most probable configuration. (If this were true for N ~ 3—which, obviously, it 
is not— we would conclude that P 5 = P 1 = P ]3 = 1/3 for the case of distinguishable 
particles.) I’ll return to this point in Section 5.4.3, but first we need to generalize the 
counting procedure itself. 

^Problem 5.19 


(a) Construct the completely antisymmetric wave function xjr(x A ,x B ,xc) for three 
identical fermions, one in the state i> 5 - one in the state \jr 1 , and one in the state 

^ 13 - 

(b) Construct the completely symmetric wave function \jf(x A , x B , xc) for three iden¬ 
tical bosons, (i) if all three are in state xj/g, (ii) if two are in state xf / 3 and one 
is in state xp- ]3 , and (iii) if one is in state xjf$, one in state xj/q, and one in state 
fn- 

^Problem 5.20 Suppose you had three particles in a one-dimensional harmonic 
oscillator potential, in thermal equilibrium, with a total energy E = (9/2)hco. 

(a) If they are distinguishable particles (but all with the same mass), what are the pos¬ 
sible occupation-number configurations, and how many distinct (three-particle) 
states are there for each one? What is the most probable configuration? If you 
picked a particle at random and measured its energy, what values might you get. 
and what is the probability of each one? What is the most probable energy? 
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(b) Do the same for the case of identical fermions (ignoring spin, as in the Example 
in Section 5.4.1). 

(c) Do the same for the case of identical bosons (ignoring spin). 


5.4.2 The General Case 

Now consider an arbitrary potential, for which the one-particle energies are E\, E 2 . 
£ 3 ,... , with degeneracies d \, d 2 , d 3 , ... (i.e., there are d n distinct one-particle states 
with the same energy E n ). Suppose we put N particles (all with the same mass) into 
this potential; we are interested in the configuration ( N\ , N 2 , N 3 ,...), for which there 
are /V, particles with energy £), N 2 particles with energy E 2 , and so on. Question: 
How many different ways can this be achieved (or, more precisely, how many distinct 
states correspond to this particular configuration)? The answer, Q(N\, N 2 , N 3 , ...), 
depends on whether the particles are distinguishable, identical fermions, or identical 
bosons, so we’ll treat the three cases separately. 17 

First, assume the particles are distinguishable. How many ways are there to 
select (from the N available candidates) the A 7 , to be placed in the first “bin”? Answer: 
the binomial coefficient, “N choose N\”: 


( 

For there are N ways to pick the first particle, leaving (N — 1) for the second, and so 
on: 

N(N - l)(N - 2) • • • (JV - JV| + 1) = —. 

(N - A/))! 

However, this counts separately the N \! different permutations of the N\ particles, 
whereas we don’t care whether number 37 was picked on the first draw, or on the 
twenty-ninth draw; so we divide by N)!, confirming Equation 5.72. Now, how many 
different ways can those Ni particles be arranged within the first bin? Well, there 
are d\ states in the bin, so each particle has d\ choices; evidently there are (dQ Nl 
possibilities in all. Thus the number of ways to put A) particles, selected from a total 
population of N, into a bin containing d\ distinct options, is 

N\d 

N\\(N — N\)\' 

The same goes for bin 2, of course, except that there are now only (N — NQ particles 
left to work with: 

N 2 \(N-Ni - A? 2 )!’ 

l7 The presentation here follows closely that of Amnon Yariv, An Introduction to Theory and Appli¬ 
cations of Quantum Mechanics (New York: John Wiley & Sons, 1982). 


- 


N\ 


N\\(N — N\)V 


[5.72] 
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and so on. It follows that 


Q(N\, N 2 , N 3 , ...) 

N\df' (N - N,)\d? (N-N,- N 2 )\d^ 

N 2 !(JV - Ni - N 2 )\N 3 \(N - N, - N 2 - Mi)'" ' 


= = m fif 


'Ni'.N 2 \N 3 \■ 


, N n l 


[5.73 


(You should pause right now and check this result for the Example in Section 5.4.1 — 
see Problem 5.21.) 

The problem is a lot easier for identical fermions. Because they are indistir. 
guishable, it doesn’t matter which particles are in which states—the antisymmetry 
tion requirement means that there is just one /V-particle state in which a specific se: 
of one-particle states is occupied. Moreover, only one particle can occupy any gi\e' 
state. There are 

(i) 

ways to choose the N n occupied states in the nth bin, 18 so 


OQ 

G(isr 1 , J Y 2 ,^3,...) = n 


d n \ 

Nn\{d n -N n )\ 


[5.7-1 


(Check it for the Example in Section 5.4.1—see Problem 5.21.) 

The calculation is hardest for the case of identical bosons. Again, the sym- 
metrization requirement means that there is just one /V-particle state in which a spe¬ 
cific set of one-particle states is occupied, but this time there is no restriction on the 
number of particles that can share the same one-particle state. For the nth bin, the 
question becomes: how many different ways can we assign N„ identical particle' 
to d n different slots? There are many ways to solve this combinatorial problem: an 
especially clever method involves the following trick: Let dots represent particles anc 
crosses represent partitions, so that, for example, if d n =5 and N n = 7, 


• •x*x**»x*x 

would indicate that there are two particles in the first state, one in the second, three 
in the third, one in the fourth, and none in the fifth. Note that there are N n dots and 
(d n - 1) crosses (partitioning the dots into d„ groups). If the individual dots and 
crosses were labeled , there would be(N„+d„-\) \ different ways to arrange them 
But for our purposes the dots are all equivalent—permuting them ( N n ! ways) does not 


18 This should be zero, of course, if N„ > d n , and it is, provided that we consider the factorial of - 
negative integer to be infinite. 
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change the state. Likewise, the crosses are all equivalent—permuting them [(d„ — 1 )! 
ways] changes nothing. So there are in fact 


(Nn+d a - 1 )! _ (N n + d n - 1\ 
N„\(d„ — 1 )! \ N„ ) 


[5.75] 


distinct ways of assigning the N n particles to the d n one-particle states in the nth bin, 
and we conclude that 


CO 

0(Ar,,Af 2 ,iV3,...) = n 


(N„ +d n - 1 )! 
N n \(d n - 1 )! 


[5.76] 


(Check it for the Example in Section 5.4.1—see Problem 5.21.) 


^Problem 5.21 Check Equations 5.73, 5.74, and 5.76 for the Example in Section 
5.4.1. 


**Problem 5.22 Obtain Equation 5.75 by induction. The combinatorial question is 
this: How many different ways can you put N identical balls into d baskets (never 
mind the subscript n for this problem). You could stick all N of them into the third 
basket, or all but one in the second basket and one in the fifth, or two in the first and 
three in the third and all the rest in the seventh, etc. Work it out explicitly for the 
cases N = l, N = 2, N = 3, and N = 4; by that stage you should be able to deduce 
the general formula. 


5.4.3 The Most Probable Configuration 

In thermal equilibrium, every state with a given total energy E and a given particle 
number N is equally likely. So the most probable configuration (Afi, /VS, /V 3 , ...) 
is the one that can be achieved in the largest number of different ways—it is that 
particular configuration for which Q(N U N 2 , N 3 , .. .) is a maximum, subject to the 
constraints 

X>» = iV [5.77] 

n = \ 

and 

J^N n E n =E. [5.78] 

n=1 

The problem of maximizing a function F(x \, x 2 , x 3 ,...) of several variables, subject 
to the constraints f\ (X], x 2 , x 3 , ...) = 0, f 2 {x { , jc 2 , jc 3 ,...) = 0, etc., is most con¬ 
veniently handled by the method of Lagrange multipliers 19 : We introduce the new 
function 


19 See, for example, Mary Boas, Mathematical Methods in the Physical Sciences, 2nd ed. (New 
York: John Wiley & Sons, 1983), Chapter 4, Section 9. 
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G(Xj, X2, Xy, . . . , X 2, • • 

: ) = F + x t f l +x 2 f 2 + --- 

[5.79 

and set all its derivatives equal to zero: 

3 G 

3G n 

[5.80 

— = 0; 

-=0. 

dx n 

dk„ 


In our case it’s a little easier to work with the logarithm of Q, instead of O 
itself—this turns the products into sums. Since the logarithm is a monotonic function 
of its argument, the maxima of Q and ln(0 occur at the same point. So we let 


G = ln(0 + a, 


00 

AT-£>„ 


+p 


00 

E-Y,NnE„ 


[5.81 


where a and /? are the Lagrange multipliers. Setting the derivatives with respect to a 
and fi equal to zero merely reproduces the constraints (Equations 5.77 and 5.78): r. 
remains, then, to set the derivative with respect to N„ equal to zero. 

If the particles are distinguishable , then Q is given by Equation 5.73, and we 

have 

00 

G = ln(N\) + ln ^") “ ln W)] 


OG 


OG 

n = 1 

+ P 

£-X>„£„ 

n = l 


[5.8: 


Assuming that the relevant occupation numbers (N n ) are large, we can invoke Stir¬ 
ling’s approximation 20 : 


ln(z!) as zln(z) - z for z >> 1 


[5.83 


to write 


G ^ X! [Nn ln ( dn) - Nn ln(Nn) + Nn ~ aNn - p e » n »] 

+ \n(N\)+aN + PE. [5.84; 


It follows that 


3G 

W n 


= \xt(d n ) — ln(N») — a — ft E n . 


[5.85] 


20 See George Arfken, Mathematical Methods for Physicists , 3rd ed. (Orlando, FL: Academic 
Press, 1985), Section 10.3. If the relevant occupation numbers are not large—as in the Example of Sectior 
5.4,1—then statistical mechanics simply doesn’t apply. The whole point is to deal with such enormou' 
numbers that statistical inference is a reliable predictor. Of course, there will always be one-particle state' 
of extremely high energy that are not populated at all\ fortunately, Stirling’s approximation holds also for 
z = 0. I use the word “relevant” to exclude any stray states right at the margin, for which N n is neither 
huge nor zero. 
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Setting this equal to zero and solving for N„, we conclude that the most probable 
occupation numbers for distinguishable particles are 

N n = d n e- (a+(iEn) . [5.86] 

If the particles are identical fermions, then Q is given by Equation 5.74, and 
we have 

00 

g = Y1 ( ln W) - ln(A/J) - ln [(<4 - A«)!]} 


+ ct 


00 


+p 


E ~Y^N n E n 


[5.87] 


This time we must assume not only that N n is large, but also that d n > N n f so that 
Stirling’s approximation applies to both terms. In that case 


G % ]T[ln(<4 !) “ Nn ln(A " n) + N «- ( d n - N„) In (d n - N„) 


+ (d n - N n ) - aN„ - fE n N n J + aN + fE, [5.88] 

so 

= - \n(N n ) + \n(d n - N n ) - a - fE n . [5.89] 

oN„ 

Setting this equal to zero and solving for N„, we find the most probable occupation 
numbers for identical fermions: 


N n 


d n 

e (a+PE n ) + i ■ 


[5.90] 


Finally, if the particles are identical bosons, then Q is given by Equation 5.76, 
and we have 


00 

g = J2 {ln[(A ^ + dn - 1)!] - In (An!) - ln[(c4 - 1)!]} 


+ a 


N-±N r , 


+ 




[5.91] 


21 In one dimension the energies are nondegenerate (see Problem 2.42), but in three dimensions d n 
typically increases rapidly with increasing n (for example, in the case of hydrogen, d» = n 2 ). So it is not 
unreasonable to assume that for most of the occupied states d n i. On the other hand, d n is certainly not 
much greater than N n at absolute zero, where all states up to the Fermi level are filled, and hence d n = N n . 
Here again we are rescued by the fact that Stirling’s formula holds also for z = 0. 
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Assuming (as always) that N n » 1, and using Stirling’s approximation: 

oo 

G kY^\(N„ + d n - \) ln(iV„ + d n - 1) - (N n + d„ - 1) - N n ln(N„) 


+ N„- ln[(rf„ - 1)!] - aN„ - 0E„N„} + aN + fiE, [5.92; 


SO 


3 G 
dN~„ 


= In +d n - 1) - In (A„) -a-pE r 


[5.93; 


Setting this equal to zero and solving for A„, we find the most probable occupation 
numbers for identical bosons: 


A fl 


d n - 1 


g(ot+ftE„) — J 


[5.94: 


(For consistency with the approximations already invoked, we should really drop the 
1 in the numerator, and I shall do so from now on.) 

Problem 5.23 Use the method of Lagrange multipliers to find the area of the 
largest rectangle, with sides parallel to the axes, that can be inscribed in the ellipse 
C x/a ) 2 + (y/b) 1 = 1. 


Problem 5.24 

(a) Find the percent error in Stirling’s approximation for z = 10. 

(b) What is the smallest integer z such that the error is less than 1%? 


5.4.4 Physical Significance of ot and ft 

The parameters a and came into the story as Lagrange multipliers, associated 
with the total number of particles and the total energy, respectively. Mathematically, 
they are determined by substituting the occupation numbers (Equations 5.86, 5.90. 
and 5.94) back into the constraint equations (Equations 5.77 and 5.78). To earn 
out the summation, however, we need to know the allowed energies (E n ) and their 
degeneracies ( d n ) for the potential in question. As an example, I’ll work out the case 
of a three-dimensional infinite square well; this will enable us to infer the physical 
significance of a and fi. 

In Section 5.3.1 we determined the the allowed energies (Equation 5.39): 

E t = [5.95] 


where 
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7in x nny nn z \ 

~T7' ~w 


As before, we convert the sum into an integral, treating k as a continuous variable, 
with one state (or, for spin s, 2s + 1 states) per volume it 3 /V of /.'-space. Taking as 
our “bins” the spherical shells in the first octant (see Figure 5.4), the “degeneracy” 
(that is, the number of states in the bin) is 


d k 


1 4nk 2 dk 
8 (* 3 IV) 


2n 2 


k l dk. 


[5.96] 


For distinguishable particles (Equation 5.86), the first constraint (Equation 5.77) be¬ 
comes 


N = 


2 7T 2 


jf 


e -P* 2 k 2 l2m k 2 dk= y e -c 



SO 


N ( 2 tt^ 2 \ 3/2 
V\ m ) 


The second constraint (Equation 5.78) says 

2 roc 


r = Jl e -»l r 

2tt 2 2m J 0 


e -f>hVl2n, k A ^ = 

2/3 


or, putting in Equation 5.97 for e 01 , 



[5.97] 


E 


3 N 
2$' 


[5.98] 


(If you include the spin factor, 2^ + 1, in Equation 5.96, it cancels out at this point, 
so Equation 5.98 is correct regardless of spin.) 

Equation 5.98 is reminiscent of the classical formula for the average kinetic 
energy of an atom at temperature T 22 : 


E 

~N 


h T ’ 


[5.99] 


where k B is the Boltzmann constant. This suggests that is related to the temperature: 


k B T 


[5.100] 


22 See, for example, David Halliday and Robert Resnick, Fundamentals of Physics. 3rd ed. extended 
(New York; John Wiley & Sons, 1988), Section 21-5. 
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To prove that this holds in general, and not simply for distinguishable particles in lit 
three-dimensional infinite square well, we would have to demonstrate that differed 
substances in thermal equilibrium with one another have the same value of ft. Th? 
argument is sketched in many books, 23 but I shall not reproduce it here—I will simp 1 * 
adopt equation [5.100] as the definition of T. 

It is customary to replace a (which, as is clear from the special case of Equatior 
5.97, is a function of T) by the so-called chemical potential, 

pi(T) = -ak B T, [5.101 

and rewrite Equations 5.86, 5.90, and 5.94 as formulas for the most probable numbt r 
of particles in a particular (one-particle) state with energy € (to go from the numbe* 
of particles with a given energy to the number of particles in a particular state witr 
that energy, we simply divide by the degeneracy of the state): 

t e u- l , )/ k il i- MAXWELL-BOLTZMANN 


n(c) = j e ie-fi)/k B T _|_ 1 ’ 


FERMI-DIRAC 


BOSE-EINSTEIN 


The Maxwell-Boltzmann distribution is the classical result for distinguishable par¬ 
ticles; the Fermi-Dirac distribution applies to identical fermions , and the Bose- 
Einstein distribution is for identical bosons. 

The Fermi-Dirac distribution has a particularly simple behavior as T -> 0: 

j€-n)/k B T f 0’ lf 6 < 

I oo, if 6 > p(0). 


[5.103 

[0, if € > jU,(0). 

All states are filled, up to an energy /i(0), and none are occupied for energies abo\e 
this (Figure 5.8). Evidently the chemical potential at absolute zero is precisely the 
Fermi energy: 

p(0) = e f . [5.104; 

As the temperature rises, the Fermi-Dirac distribution “softens” the cutoff, as indicated 
by the rounded curve in Figure 5.8. 

For distinguishable particles in the three-dimensional infinite square well, ue 
found (Equation 5.98) that the total energy at temperature T is 

E=^Nk B T; [5.105; 


23 See, for example, Yariv, footnote 17, Section 15.4. 
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Figure 5.8: Fermi-Dirac distribution for 7 = 0 and for T somewhat above zero. 


from Equation 5.97 it follows that 

= «(£) + ! >»£§:)]. [5.106] 

I would like to work out the corresponding formulas for identical fermions and bosons, 
using Equations 5.90 and 5.94 in place of Equation 5.86. The first constraint (Equation 
5.77) becomes 

V k 2 

N ~ 2Jr 2 J 0 e [(n 2 ky2m)-nyk B T ± { dk [5.107] 

(with the plus sign for fermions and minus for bosons), and the second constraint 
(Equation 5.78) reads 


V h 2 r°° k A 

£ = —— / —— - dk. [5.108] 

2tT 2 2m Jo e Uh 2 k 2 /2m)-ij.]/kBT ± J 

The first of these determines i.i(T), and the second determines E(T) (from the latter 
we obtain, for instance, the heat capacity C = 3E/3T ). Unfortunately, the integrals 
cannot be evaluated in terms of elementary functions, and I shall leave it for you to 
explore the matter further (see Problems 5.25 and 5.26). 


Problem 5.25 Evaluate the integrals (Equations 5.107 and 5.108) for the case of 
identical fermions at absolute zero. Compare your results with Equations 5.43 and 
5.45. (Note that for electrons there is an extra factor of 2 in Equations 5.107 and 
5.108, to account for the spin degeneracy.) 

***Problem 5.26 

(a) Show that for bosons the chemical potential must always be less than the mini¬ 
mum allowed energy. Hint: n(e) cannot be negative. 

(b) In particular, for the ideal bose gas (identical bosons in the three-dimensional 
infinite square well), (x(T) < 0 for all T. Show that in this case ii{T) mono- 
tonically increases as T decreases, assuming that N and V are held constant. 
Hint. Study Equation 5.107, with the minus sign. 
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(c) A crisis (called bose condensation) occurs when (as we lower T ) fMT) hr.' 
zero. Evaluate the integral, for p = 0, and obtain the formula for the critic- 
temperature T c at which this happens. Note : Below the critical temperature, the 
particles crowd into the ground state, and the calculational device of replacing 
the discrete sum (Equation 5.77) by a continuous integral (Equation 5.10* 
loses its validity. See F. Mandl, Statistical Physics (London: John Wile\ a. 
Sons, 1971), Section 11.5. Hint : 


(d) 


t. 



dx = T(,$)£(,$), 


[5.10- 


where Y is Euler’s gamma function and £ is the Riemann zeta function. Loo* 
up the appropriate numerical values. 


Find the critical temperature for 4 He. Its density, at this temperature, is 0.1.' 
gm/cm 3 . Note: The experimental value of the critical temperature in 4 He is 2.1 ' 
K. The remarkable properties of 4 He in the neighborhood of T c are discussed r 
the reference cited in (c). 


5.4.5 The Blackbody Spectrum 

Photons (quanta of the electromagnetic field) are identical bosons with spin 1, but the;, 
are a very special case because they are massless particles, and hence intrinsicalh 
relativistic. We can include them here, if you are prepared to accept four assertion' 
that do not really belong to nonrelativistic quantum mechanics: 

(1) The energy of a photon is related to its frequency by the Planck for mu L 
E = hv = hco. 

(2) The wave number k is related to the frequency by k = 2n/k = cofc, where . 
is the speed of light. 

(3) Only two spin states occur (the quantum number m can be +1 or —1, but no: 

0 ). 

(4) The number of photons is not a conserved quantity; when the temperature rise' 
the number of photons (per unit volume) increases. 


In view of item 4, the first constraint equation (Equation 5.77) does not appl> 
We can take account of this by simply setting a -► 0, in Equation 5.81 and everything 
that follows. Thus the most probable occupation number for photons is (Equation 


5.94) 




dk 

^ha/keT _ 1 


[5.11C 


For free photons in a box of volume V,d k is given by Equation 5.96, 24 multiplied b> 
2 for spin (item 3), and expressed in terms of co instead of k (item 2): 


24 In truth, we have no business using this formula, which came from the (nonrelativistic) Schrodingr- 
equation; fortunately, the degeneracy is exactly the same for the relativistic case. See Problem 5.3.2. 
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Figure 5.9: Planck’s formula for the blackbody spectrum, Equation 5.112. 


d k = - 


1 dco. 


[5.111] 


So the energy density N^hco/ V , in the frequency range d(o, is p(oj) da >, where 

ha) 3. 


-- 


tt 2 c 3 — l ) 


[5.112] 


This is Planck’s famous blackbody spectrum, giving the energy per unit volume, 
per unit frequency, in an electromagnetic field at equilibrium at temperature T. It is 
plotted, for three different temperatures, in Figure 5.9. 


Problem 5.27 Use Equation 5.112 to determine the energy density in the wave¬ 
length range dX. Hint: set p(co)d(o = p(X)dX , and solve for ~p(X). Derive the Wien 
displacement law for the wavelength at which the blackbody energy density is a 
maximum: 


X 


max — 


2.90 x 10" 3 mK 


[5.113] 


You’ll need to solve the transcendental equation (5 — x) = 5e x , using a calculator 
(or a computer); get the numerical answer accurate to three significant digits. 
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Problem 5.28 Derive the Stefan-Boltzmann formula for the total energy densit> 
in blackbody radiation: 

- = (T a = (7.57 x 10- 16 Jm- 3 K- 4 ) T 4 . [5.114] 

V V15 » 3 cV ' 

Hint. Use the hint in Problem 5.26(c) to evaluate the integral. Notethat£(4) = 7T 4 /90. 

FURTHER PROBLEMS FOR CHAPTER 5 _ 

Problem 5.29 Suppose you have three particles, and three distinct one-particle 
states (i Jr a (x), ifb(x), and if/Ax)) are available. How many different three-particle 
states can be constructed (a) if they are distinguishable particles, (b) if they are iden¬ 
tical bosons, and (c) if they are identical fermions? [The particles need not be in 
different states— f a (x])ir a (x 2 )f a (x 3 ) would be one possibility if the particles are 
distinguishable.] 

Problem 5.30 Calculate the Fermi energy for electrons in a ftvo-dimensional infi¬ 
nite square well. (Let o be the number of free electrons per unit area.) 

***Problem 5.31 Certain cold stars (called white dwarfs) are stabilized against grav¬ 
itational collapse by the degeneracy pressure of their electrons (Equation 5.46). As¬ 
suming constant density, the radius R of such an object can be calculated as follows: 

(a) Write the total electron energy (Equation 5.45) in terms of the radius, the number 
of nucleons (protons and neutrons) N, the number of electrons per nucleon q. 
and the mass of the electron tn. 

(b) Look up, or calculate, the gravitational energy of a uniformly dense sphere. 
Express your answer in terms of G (the constant of universal gravitation), R, A . 
and M (the mass of a nucleon). Note that the gravitational energy is negative. 

(c) Find the radius for which the total energy, (a) plus (b), is a minimum. Answer: 

/ 9jt \ 2/3 7*y/ 3 

R ~ VT/ GmM 2 N'/ r 

(Note that the radius decreases as the total mass increases !) Put in the actual 
numbers, for everything except N, using q = 1/2 (actually, q decreases a bit as 
the atomic number increases, but this is close enough for our purposes). Answer: 
R = 7.6 x 10 25 

(d) Determine the radius, in kilometers, of a white dwarf with the mass of the sun. 

(e) Determine the Fermi energy, in electron volts, for the white dwarf in (d), and 
compare it with the rest energy of an electron. Note that this system is getting 
dangerously relativistic (see Problem 5.32). 

***Problem 5.32 We can extend the theory of a free electron gas (Section 5.3.1) to 
the relativistic domain by replacing the classical kinetic energy, E = p 2 /2m, with 
the relativistic formula, E = yj p 2 c 2 + m 2 c 4 -me 2 . Momentum is related to the 
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wave vector in the usual way: p = Tik. In particular, in the extreme relativistic limit, 

E % pc — hck. 

(a) Replace h 2 k 2 /2m in Equation 5.44 by the ultrarelativistic expression, Tick, and 
calculate in this regime. 

(b) Repeat parts (a) and (b) of Problem 5.31 for the ultrarelativistic electron gas. 
Notice that in this case there is no stable minimum, regardless of R; if the 
total energy is positive, degeneracy forces exceed gravitational forces and the 
star will expand, whereas if the total is negative, gravitational forces win out 
and the star will collapse. Find the critical number of nucleons N c such that 
gravitational collapse occurs for N > N c . This is called the Chandrasekhar 
limit. Answer: 2.0 x 10 57 . What is the corresponding stellar mass (give your 
answer as a multiple of the sun’s mass). Stars heavier than this will not form 
white dwarfs, but collapse further, becoming (if conditions are right) neutron 
stars. 

(c) At extremely high density, inverse beta decay, e~ + p + n + v, converts 
virtually all of the protons and electrons into neutrons (liberating neutrinos, 
which carry off energy, in the process). Eventually neutron degeneracy pressure 
stabilizes the collapse, just as electron degeneracy does for the white dwarf (see 
Problem 5.31). Calculate the radius of a neutron star with the mass of the sun. 
Also calculate the (neutron) Fermi energy, and compare it to the rest energy of 
a neutron. Is it reasonable to treat such a star nonrelativistically? 


***Problem 5.33 


(a) Find the chemical potential and the total energy for distinguishable particles in 
the three-dimensional harmonic oscillator potential (Problem 4.39). Hint: The 
sums in Equations 5.77 and 5.78 can be evaluated exactly in this case—no need 
to use an integral approximation, as we did for the infinite square well. Note 
that by differentiating the geometric series, 


you can get 


I °o 

rb = I>- 

1 A n= 0 


(n + l)x n 


and similar results for higher derivatives. Answer: 


[5.115] 


(b) 

(c) 


E = 


3 + (\ + e -*”/*BT\ 

-NTuo _ e _ hw/ksT J 


Discuss the limiting case k B T 


[5.116] 


Discuss the classical limit, k B T » hco, in the light of the equipartition theorem 
(see, for example, Halliday and Resnick, footnote 22, Section 21-9). How many 
degrees of freedom does a particle in the three-dimensional harmonic oscillator 
possess? 



PART II 


APPLICATIONS 




CHAPTER 6 


TIME-INDEPENDENT 
PERTURBATION THEORY 


6.1 NONDEGENERATE PERTURBATION THEORY 


6.1.1 General Formulation 

Suppose we have solved the (time-independent) Schrodinger equation for some po¬ 
tential (say, the one-dimensional infinite square well): 

= [ 6 . 1 ] 

obtaining a complete set of orthonormal eigenfunctions, 

WM)=8nm, [ 6 - 2 ] 

and the corresponding eigenvalues E„. Now we perturb the potential slightly (say, 

by putting a little bump in the bottom of the well—Figure 6.1). We’d like to solve for 
the new eigenfunctions and eigenvalues: 

Hf„ = E n f n , [6.3] 

but unless we are very lucky, we’re unlikely to be able to solve the Schrodinger 
equation exactly, for this more complicated potential. Perturbation theory is a 
systematic procedure for obtaining approximate solutions to the perturbed problem 
by building on the known exact solutions to the unperturbed case. 
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Figure 6.1: Infinite square well with 
small perturbation. 


To begin with, we write the new Hamiltonian as the sum of two terms: 

H = H° + kff', [6.4; 

where H' is the perturbation. For the moment we’ll take A to be a small number; later 
we’ll crank it up to 1, and H will be the true, exact Hamiltonian. Writing f n and £, 
as power series in A, we have 

f n = fl + A^ 1 + A + [6.5 ] 

E„ = £° + A E\ + A 2 £ 2 + • • •. [6.6] 

Here E\ is the first-order correction to the n th eigenvalue, and fl is the first-order 
correction to the / 1 th eigenfunction; E 2 and fl are the second-order corrections, and 
so on. Plugging Equations 6.4, 6.5, and 6.6 into Equation 6.3, we have 

(H° + A H')[fl + Xfl + A 2 f 2 n + ...] 

= (El + A E\ + A 2 E 2 n + • • -)[fl + A ir l n + k 2 f 2 n + ••■], 
or (collecting like powers of A): 

H°fl + A (H°fl + H'fl) + A 2 (H°fl + H'f l n ) + • ■ • 

= Elfl + A (Elfl + E l n fl) + A 2 (Elf 2 n + E\fl + E 2 n fl) + ■ • •. 

To lowest order (A 0 ) this yields H°fl = Elfl, which is nothing new (just Equation 
6.1). To first order (A 1 ), 


H°f l n + H'fl == Elfl + E l n fl. [6.7] 

To second order (A 2 ), 

H°fl + H'fl = Elfl + Elf l + Elfl, [6.8] 

and so on. (I’m done with A, now—it was just a device to keep track of the different 
orders—so crank it up to 1.) 
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Figure 6.2: Constant perturbation over 
the whole well. 


6.1.2 First-Order Theory 


Taking the inner product of Equation 6.7 with [that is, multiplying by (t/r^)* and 
integrating], 

/ + (Vjff'tn) = ElWnW'n) + <*.V»°>. 

But H° is Hermitian, so 


and this cancels the first term on the right. Moreover, = 1, so 1 

E' n = {f 0 n \H f \f° n ). 


[6.9] 


This is the fundamental result of first-order perturbation theory; as a practical matter, 
it may well be the most important equation in quantum mechanics. It says that the 
first-order correction to the energy is the expectation value of the perturbation in the 
unperturbed state. 


Example. The unperturbed wave functions for the infinite square well are 
(Equation 2.24) 



Suppose first that we perturb the system by simply raising the “floor” of the well by a 
constant amount Vq (Figure 6.2). In that case ET = Vo, and the first-order correction 
to the energy of the n* state is 


El = WO W„°) = VoWM) = Vo- 


'In this context it doesn’t matter whether we write (i/r°| H'lfr®) or (with the extra 

vertical bar) because we are using the wave function itself to “label” the state. But the latter notation is 
preferable because it frees us from this specific convention. 
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Figure 6.3: Constant perturbation over 
half the well. 


The corrected energy levels, then, are E n = E® + Vq; they are simply lifted by the 
amount Vq. Of course ! The only surprising thing is that in this case the first-order 
theory yields the exact answer. Evidently, for a constant perturbation all the higher 
corrections vanish. 2 If, on the other hand, the perturbation extends only halfwa> 
across the well (Figure 6.3), then 



In this case every energy level is lifted by Vo/2. That’s not the exact result, presumably, 
but it does seem reasonable as a first-order approximation. 


Equation 6.9 is the first-order correction to the energy, to find the first-order 
correction to the wave function we first rewrite Equation 6.7: 

(H° - E J)^ 1 = ~(H r - E l n )f°. [6.10] 

The right side is a known function, so this amounts to an inhomogeneous differential 
equation for . Now, the unperturbed wave functions constitute a complete set, so 
(like any other function) can be expressed as a linear combination of them: 

ft! = E e »V»- [6-ID 

m^n 


[There is no need to include m = n in the sum, for if t/rj satisfies Equation 6.10, so 
too does (^j + orf f), for any constant a, and we can use this freedom to subtract 
off the i/fO term. 3 ] If we could determine the coefficients we’d be done. Well, 
putting Equation 6.11 into Equation 6.10, and using the fact that the satisfies the 


incidentally, nothing here depends on the specific nature of the infinite square well—the same 
result applies for any potential, when the perturbation is constant. 

3 Alternatively, a glance at Equation 6.5 reveals that any t/r° component in i^] might as well be 
pulled out and combined with the first term. We are only concerned, for the moment, with solving the 
Schrodinger equation (Equation 6.3), and the we get will not, in general, be normalized. 
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unperturbed Schrodinger equation (Equation 6.1), we have 

= -(//' - 

m^n 


Taking the inner product with ^°, 

Wl) = -W?\H'\f°) + E'jf?\f°). 


If / = n, the left side is zero, and we recover Equation 6.9; if l ^ n, we get 

(£? - El)cf = 


or 


C ” £0 _ £0 


[ 6 . 12 ] 


SO 


*.‘ = E 

m^n 


,Q 

(J?J - 


[6.13] 


Notice that the denominator is safe, since there is no coefficient with m = n, as long as 
the unperturbed energy spectrum is nondegenerate. But if two different unperturbed 
states share the same energy, we’re in serious trouble (we divided by zero to get 
Equation 6.12); in that case we need degenerate perturbation theory, which I’ll 
come to in Section 6.2. 

That completes first-order perturbation theory: E\ is given by Equation 6.9, 
and is given by Equation 6.13. I should warn you that whereas perturbation theory 
often yields surprisingly accurate energies (that is, E° + E\ is quite close to the exact 
value E n ), the wave functions are notoriously poor. 


❖Problem 6.1 Suppose we put a delta-function bump in the center of the infinite 
square well: 

H' =aS(x-a/ 2), 

where a is a constant. Find the first-order correction to the allowed energies. Explain 
why the energies are not perturbed for even n. 


❖Problem 6.2 For the harmonic oscillator [V (x) = (l/2)foc 2 ], the allowed energies 
are 

E n ~ (n + 1/2 )fuo, (n = 0, 1, 2, ...), 

where to — y/k/m is the classical frequency. Now suppose the spring constant 
increases slightly: k -> (1 + e)k. (Perhaps we cool the spring, so it becomes less 
flexible.) 
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(a) Find the exact new energies (trivial, in this case). Expand your formula a» _ 
power series in e, up to second order. 

(b) Now calculate the first-order perturbation in the energy, using Equation 6> 
What is H f here? Compare your result with part (a). Hint : It is not necessan - 
in fact, it is not permitted—to calculate a single integral in doing this problem 

Problem 6.3 Two identical bosons are placed in an infinite square well (Equation 

2.15). They interact weakly with one another, via the potential 

V (*i, x 2 ) = -a V 0 8(x x - x 2 ) 

(where Vq is a constant with the dimensions of energy and a is the width of the well 

(a) First, ignoring the interaction between the particles, find the ground state anc 
first excited state—both the wave functions and the associated energies. 

(b) Use first-order perturbation theory to calculate the effect of the particle-particle 
interaction on the ground and first excited state energies. 


6.1.3 Second-Order Energies 


Proceeding as before, we take the inner product of the second -order equation (Equa¬ 
tion 6.8) with 

<*»°l#V„ 2 > + (fl\H't\) - + £>»>„'> + E 2 n {fM). 

Again, we exploit the Hermiticity of //°: 

so the first term on the left cancels the first term on the right. Meanwhile, = 1- 

and we are left with a formula for E 1 - 


But 


El = - ElifM). 

If, 1 ) = = o, 


El = = E 


or, finally. 


£ « 2 = E 




E2 - El 


[6.14] 
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This is the fundamental result of second-order perturbation theory. We could proceed 
to calculate the second-order correction to the wave function (t/^X the third-order 
correction to the energy, and so on, but in practice Equation 6.14 is ordinarily as high 
as it is useful to pursue this method. 

**Problem 6.4 

(a) Find the second-order correction to the energies (E 2 ) for the potential in Problem 
6.1. Note: You can sum the series explicitly to obtain the result —2m{a/7thn) 2 , 
for odd n. 

(b) Calculate the second-order correction to the ground-state energy (Eq) for the 
potential in Problem 6.2. Check that your result is consistent with the exact 
solution. 

**Problem 6.5 Consider a charged particle in the one-dimensional harmonic oscil¬ 
lator potential. Suppose we turn on a weak electric field (E) so that the potential 
energy is shifted by an amount H' = —qEx. 

(a) Show that there is no first-order change in the energy levels, and calculate the 
second-order correction. Hint: See Problem 3.50. 

(b) The Schrodinger equation can be solved exactly in this case by a change of 
variables: x' = x — (qE/mto 2 ). Find the exact energies, and show that they are 
consistent with the perturbation theory approximation. 


6.2 DEGENERATE PERTURBATION THEORY _ 

If the unperturbed states are degenerate—that is, if two (or more) distinct states 
(i/r° and V'f) share the same energy—then ordinary perturbation theory fails: 
(Equation 6.12) and E 2 (Equation 6.14) blow up (unless, possibly, the numerator 
vanishes, — 0—a loophole that will be important to us later on). In the 

degenerate case, therefore, there is no reason to trust even the first -order correction 
to the energy (Equation 6.9), and we must look for some other way to handle the 
problem. 

6.2.1 Twofold Degeneracy 
Suppose that 

ffV° = EV“, and<^V?)=0. [6.15] 

Note that any linear combination of these states, 


[6.16] 
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is still an eigenstate of H°, with the same eigenvalue E°: 

//V° = £V°. [6.r; 

Typically, the perturbation ( H') will “break” the degeneracy: As we increase X (from 
0 to 1), the common unperturbed energy E° splits into two (Figure 6.4). 

The essential problem is this: When we turn off the perturbation, the “upper” state 
reduces down to one linear combination of and and the “lower” state reduce^ 
to some other linear combination, but we don’t know a priori what these “good” 
linear combinations will be. For this reason we can’t even calculate the first-order 
energy (Equation 6.9) because we don’t know what unperturbed states to use. 

For the moment, therefore, let’s just write the “good” unperturbed states in the 
general form (Equation 6.16), keeping a and ft adjustable. We want to solve the 
Schrodinger equation, 

Hf = Eijr, [6.181 

with H = H° + XH' and 

E = E° + kE l + X 2 E 2 + ■ • •, rfr = + Xf l + X 2 f 2 + • ■ •. [6.19] 

Plugging these into Equation 6.18, and collecting like powers of X, as before, we find 

#V° + X{H r f Q + i/V) + • • • = £ v + X(E V° + ^V 1 ) + • • • • 

But #V° = (Equation 6.17), so the first terms cancel; at order A 1 we have 

= E°fi l + £V°. [6.20] 

Taking the inner product with \fr®: 

Wa\ #V> + tf>°> = E°(tf\1r l ) + E l (^\ir°). 

Because H {] is Hermitian, the first term on the left cancels the first term on the right. 
Putting in Equation 6.16 and exploiting the orthonormality condition Equation 6.15. 
we obtain 

+ MaWWl) = aE\ 



Figure 6.4: “Lifting” of a degeneracy by a 
perturbation. 
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or, more compactly, 

CcWaa + PWab = OlE\ [6.21] 

where 

Wij = ( i , j = a, b ). [ 6 . 22 ] 

Similarly, the inner product with f b yields 

cxW ba + ftW bb = ftE x . [6.23] 

Notice that the W ’s are (in principle) known —they are just the “matrix elements” 
of H\ with respect to the unperturbed wave functions and f b . Multiplying 
Equation 6.23 by W ab , and using Equation 6.21 to eliminate ftW a b, we find 

a[W ab Wba - (E l - W aa )(E 1 - W bb )\ = 0. [6-24] 

If a is not zero, Equation 6.24 yields an equation for E l : 

(E 1 ) 2 - E\Waa + Wbb) + (WaaWbb - W ab W ba ) = 0. [6.25] 

Invoking the quadratic formula, and noting (from Equation 6.22) that W ba = W* b , 
we conclude that 

E' ± = X - \w aa + W bb ± b /(W aa -W bb ) 1 + 4\Wab\ 2 ] • [6-26] 


This is the fundamental result of degenerate perturbation theory; the two roots corre¬ 
spond to the two perturbed energies. 

But what if a is zero? In that case ft = 1, Equation 6.21 says W ab ~ 0, 
and Equation 6.23 gives E 1 = W bb . This is actually included in the general result 
(Equation 6.26), with the plus sign (the minus sign corresponds to a — 1, ft — 0). 
What’s more, the answers , 

E\ = w bb = (V^l tf'IV'"}, E- = W™ = wlw'wl), 

are precisely what we would have obtained using nondegenerate perturbation theory 
(Equation 6.9)—we have simply been lucky: The states ^ and were already the 
“correct” linear combinations. Obviously, it would be greatly to our advantage if we 
could somehow guess the “good” states right from the start. In practice, we can often 
do so by exploiting the following theorem: 

Theorem: Let A be a Hermitian operator that commutes with H'. If and 
f b are eigenfunctions of A with distinct eigenvalues, 

A= Ufa, Afl = vfl, and 

then Wab = 0 (and hence f" and are the “good” states to use in perturbation 
theory). 
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Proof: By assumption, [A, H '] = 0, so 


h'w°) 


o 

(ft - = (/i - v)W ai . 


But /i / u, so W ab = 0. QED 


Moral: If you’re faced with degenerate states, look around for some Hermitian 
operator A that commutes with H'\ pick as your unperturbed states ones that are si¬ 
multaneously eigenfunctions of H° and A. Then use ordinary first-order perturbation 
theory. If you can’t find such an operator, you’ll have to resort to Equation 6.26, but 
in practice this is seldom necessary. 

Problem 6.6 Let the two “good” unperturbed states be 

fl = a ±fa + Mb> 

where a± and are determined (up to normalization) by Equation 6.21 (or Equation 
6.23), with Equation 6.26 for E±. Show explicitly that 

(a) iare orthogonal ({i/^l f®} =0); 

(b) Wl\H'\i,°_)=0-, 

(C) = El 

Problem 6.7 Consider a particle of mass m that is free to move in a one-dimensional 
region of length L that closes on itself (for instance, a bead which slides frictionlessly 
on a circular wire of circumference L \ Problem 2.43). 

(a) Show that the stationary states can be written in the form 


fn(x) = -—e Zninx / L , (-L/2 < X < Lf 2 ), 

Vi 

where n = 0, ±1, ± 2 , ..., and the allowed energies are 

*(=?)'■ 

Notice that—with the exception of the ground state (n = 0)—these are all 
doubly degenerate. 

(b) Now suppose we introduce the perturbation 
H' = 
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where a <g; L. (This puts a little “dimple” in the potential at x = 0, as though 
we bent the wire slightly to make a “trap.”) Find the first-order correction to 
E n , using Equation 6.26. Hint: To evaluate the integrals, exploit the fact that 
a < L to extend the limits from ±L jl to ±oo; after all, H’ is essentially zero 
outside —a<x<a. 

(c) What are the “good” linear combinations of \js n and for this problem? Show 

that with these states you get the first-order correction using Equation 6.9. 

(d) Find a Hermitian operator A that fits the requirements of the theorem, and show 
that the simultaneous eigenstates of H° and A are precisely the ones you found 
in (c). 


6.2.2 Higher-Order Degeneracy 

In the previous section I assumed the degeneracy was twofold, but it is easy to see 
how the method generalizes. Rewrite Equations 6.21 and 6.23 in matrix form: 

( W aa (“1 =£' (“V [6.27] 

w bb )\p) W 

Evidently the E u s are nothing but the eigenvalues of the W- matrix; Equation 6.25 
is the characteristic equation (Equation 3.70) for this matrix, and the “good” linear 
combinations of the unperturbed states are the eigenvectors of IF. In the case of 
n-fold degeneracy, we look for the eigenvalues of the n x n matrix 

W,j = [6.281 


In the language of linear algebra, finding the “good” unperturbed wave functions 
amounts to constructing a basis in the degenerate subspace that diagonalizes the 
perturbation H'. Once again, if you can think of an operator A that commutes with 
H r , and use the simultaneous eigenfunctions of A and H°, then the W matrix will 
automatically be diagonal, and you won’t have to fuss with solving the characteristic 
equation. 


Example. Consider the three-dimensional infinite cubical well (Problem 4.2): 


V(x,y, z) 


0, if 0 < x < a, 0 < y < a, and 0 < z < a; 
oo otherwise. 


[6.29] 


The stationary states are 

^n x n y n z (*> z ) 



/n x n \ . (n y 7t \ . / n z 7t \ 

{—■ 9 sin (.—■ y ) sin (— j v- 


[6.30] 
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Figure 6.5: The perturbation increases the 
potential by an amount Vq in the shaded sector. 


where n x , n y , and n z are positive integers. The corresponding allowed energies are 


W (n * + n r + n >)• 

Notice that the ground state (i^m) is nondegenerate; its energy is 

£° = 3^ 

0 2ma 2 ' 

But the first excited state is (triply) degenerate: 

^=^112, ^ = 1^121, and ^T c s ^211, 

all share the energy 


7r 2 a 2 

ma 2 


£? = y-_ 2 . 


Now let’s introduce the perturbation 


jj, _ Vq, if 0 < x < a/2 and 0 < y < a/2\ 
0, otherwise. 


[6.31] 


[6.32] 


[6.33] 


[6.34] 


[6.35] 


This raises the potential by an amount V 0 in one quarter of the box (see Figure 6.5). 
The first-order correction to the ground state energy is given by Equation 6.9: 

4 = = (2) Vq sin 2 (“*) dx 

° [6.36] 

r sm2 &) dy i sin2 (r) rfz= i ,/ »’ 

which is just what we would expect. 

For the first excited state we need the full machinery of degenerate perturbation 
theory. The first step is to construct the matrix W. The diagonal elements are the 
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same as for the ground state (except that the argument of one of the sines is doubled); 
you can check for yourself that 

1 

W aa = W bb = W cc = -V 0 . 

The off-diagonal elements are more interesting: 

w ° b = (!) v °l' ™ 2 &) ix 

r ^ (r) ^ (t v ) dy i‘ sin (t z ) sin (r) 

But the z integral is zero (as it will be also for W ac ), so 
Wab = W ac = 0. 


Finally, 


Thus 



[6.37] 


where k = (8/3 it') 2 ^ 0.7205. 

The characteristic equation for W (or rather, for 4 W/ Vo , which is easier to work 
with) is 

(1 - w) 3 - k 2 ( 1 - w) = 0, 


and the eigenvalues are 

16'I - 1; u> 2 = 1 + K % 1.7205; w 3 = 1 - k ^ 0.2795. 


To first order in k, then, 


E l (k) = 


E® + XVq/4, 
£J+M1+k)Fo/4, 
£° + M1-k)Fo/4, 


[6.38] 


where ^ is the (common) unperturbed energy (Equation 6.34). The perturbation 
lifts the degeneracy, splitting E J into three distinct energy levels (see Figure 6.6). 
Notice that if we had naively applied nondegenerate perturbation theory to this prob¬ 
lem, we would have concluded that the first-order correction (Equation 6.9) is the 



234 


Chap. 6 Time-Independent Perturbation Theory 



Figure 6.6: Lifting of the degeneracy in 
the example (Equation 6.38). 


same for all three states, and equal to Vo/4 —which is actually correct only for the 
middle state. 

Meanwhile, the “good” unperturbed states are linear combinations of the form 
f° = ctf a + ftf b + yf c , [6.39 


where the coefficients (a, ft, and y) form the eigenvectors of the matrix W : 


(si ■:)(;)-•(;)■ 


Forte = 1 we get a = 1, ft = y — 0; for w = 1 ± k we get a = 0, ft — ±y = \/-Jl. 
(I normalized them as I went along.) Thus the “good” states are 4 


fa, 

(tb + f c )/V2, 

Wb - VO/V2. 


[6.40] 


Problem 6.8 Suppose we perturb the infinite cubical well (Equation 6.29) b\ 
putting a delta-function “bump” at the point (a/4, a/2, 3a/4): 

H' = a 3 V 0 S(x - a/4)S(y - a/2)S(z - 3a/4). 

Find the first-order corrections to the energy of the ground state and the (triply de¬ 
generate) first excited states. 


4 We might have guessed this result right from the start by noting that the operator P xy , which 
interchanges jc and y, commutes with H'. Its eigenvalues are +1 (for functions that are even under the 
interchange) and -1 (for functions that are odd). In this case i fr a is already even, (i fo + \fr c )is even, and 
6 h ~ iAc) is odd. However, this is not quite conclusive, since any linear combination of the even states 
would still be even. What we’d really like is an operator with three distinct eigenvalues in the degenerate 
subspace. 
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♦Problem 6.9 Consider a quantum system with just three linearly independent states. 

The Hamiltonian, in matrix form, is 

/(l—e) 0 0\ 

h = vA o i € J , 

\ 0 6 2 / 

where Vq is a constant and € is some small number (6 <§; 1). 

(a) Write down the eigenvectors and eigenvalues of the unperturbed Hamiltonian 
(6 = 0 ). 

(b) Solve for the exact eigenvalues of H. Expand each of them as a power series in 
c, up to second order. 

(c) Use first- and second-order nondegenerate perturbation theory to find the ap¬ 
proximate eigenvalue for the state that grows out of the nondegenerate eigen¬ 
vector of H°. Compare the exact result from (b). 

(d) Use degenerate perturbation theory to find the first-order correction to the two 
initially degenerate eigenvalues. Compare the exact results. 


6.3 THE FINE STRUCTURE OF HYDROGEN 


In our study of the hydrogen atom (Section 4.2), we took the Hamiltonian to be 


h 2 




2m 


V 2 


e 2 1 
4jt6o r 


[6.41] 


(electron kinetic energy plus Coulombic potential energy). But this is not quite the 
whole story. We have already learned how to correct for the motion of the nucleus: 
Just replace m by the reduced mass (Problem 5.1). More significant is the so-called 
fine structure, which is actually due to two distinct mechanisms: a relativistic 
correction, and spin-orbit coupling. Compared to the Bohr energies (Equation 
4.70), fine structure is a tiny perturbation—smaller by a factor of a 2 , where 


1 


' Ancohc 137.036 


[6.42] 


is the famous fine structure constant. Smaller still (by another factor of a) is the 
Lamb shift, associated with the quantization of the Coulomb field, and smaller by yet 
another order of magnitude is the hyperfine structure, which is due to the magnetic 
interaction between the dipole moments of the electron and the proton. This hierarchy 
is summarized in Table 6.1. 
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Table 6.1: Hierarchy of corrections to the Bohr energies of hydrogen. 


Bohr energies: of order a 2 mc 2 

Fine structure: of order a 4 mc 2 

Lamb shift: of order a 5 me 2 

Hyperfine splitting: of order ( mjm p )a 4 mc 2 


In the present section we will analyze the fine structure of hydrogen, as an application 

of time-independent perturbation theory. 

Problem 6.10 

(a) Express the Bohr energies in terms of the fine structure constant and the rest 
energy (me 2 ) of the electron. 

(b) Calculate the fine structure constant from first principles (i.e. without recourse 
to the empirical values of e 0 , e, ft , and c). Note : The fine structure constant is un¬ 
doubtedly the most fundamental pure (dimensionless) number in all of physics: 
It relates the basic constants of electromagnetism (the charge of the electron), 
relativity (the speed of light), and quantum mechanics (Planck’s constant). If 
you can solve part (b), you have the most certain Nobel Prize in history waiting 
for you. But I wouldn’t recommend spending a lot of time on it right now; mans 
smart people have tried and given up. 


6.3.1 The Relativistic Correction 


The first term in the Hamiltonian is supposed to represent kinetic energy: 

1 ? P 2 

T = -mv 2 = —, [6.43] 

and the canonical substitution p— ► ( Ti/i)V yields the operator 


f = 



[6.44] 


But Equation 6.43 is the classical equation for kinetic energy; the relativistic formula 
is 


T = 


me 2 

y/\ ~ ( V/C ) 2 


— me 2 . 


[6.45] 


The first term is the total relativistic energy (not counting potential energy, which 
we aren’t concerned with at the moment), and the second term is the rest energy— 
the difference is the energy attributable to motion. We need to express T in terms of the 
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(relativistic) momentum, 


mv 

V 1 “ ( v / c ) 2 ’ 


instead of velocity. Notice that 


[6.46] 


so 


m 2 v 2 c 2 + m 2 c 4 [ 1 - (v/c) 2 '] _ mV _ 22 

1 - (u/c) 2 1 - (v/c) 2 mC 

T = y/ p 2 c 2 + mV — me 2 . [6.47] 


This relativistic equation for kinetic energy reduces (of course) to the classical result 
(Equation 6.43), in the nonrelativistic limit p mc\ expanding in powers of the 
small number (p/me), we have 



The lowest-order 5 relativistic contribution to the Hamiltonian is evidently 


H' r = — 


P 

8 mV 


[6.49] 


In first-order perturbation theory, the correction to E„ is given by the expectation 
value of H f in the unperturbed state (Equation 6.9): 


Now the Schrodinger equation (for the unperturbed states) says 
p 2 ij/ = 2m(E - V)yff, 


and hence 6 


E: = - 


2m c 2 


((E- 


vf) = -^— 2 IE 2 

2 me 1 


-2 E(V) + (V 2 )]. 


[6.50] 

[6.51] 

[6.52] 


5 The kinetic energy of the electron in hydrogen is on the order of 10 eV, which is miniscule 
compared to its rest energy (511,000 eV), so the hydrogen atom is basically nonrelativistic, and we can 
afford to keep only the lowest-order correction. In Equation 6.48, p is the relativistic momentum (Equation 
6.46), not the classical momentum mv. It is the former that we now associate with the quantum operator 
—ihV, in Equation 6.49. 

6 There is some sleight-of-hand in this maneuver, which exploits the Hermiticity of /r and of 
(E — V). In truth, the operator // is not Hermitian, for states with / = 0, and the applicability of 
perturbation theory to Equation 6.49 is therefore called into question. Fortunately, the exact solution is 
available; it can be obtained by using the (relativistic) Dirac equation in place of the (nonrelativistic) 
Schrodinger equation, and it confirms the results we obtain here by less rigorous means. (See Problem 
6.17.) 
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So far, this is entirely general; but we’re interested in the case of hydrogen, for which 
V(r) = -(l/47t€ 0 )e 2 /r: 


E' = - 


2m c 2 


- iE *{Ly 


0/ 


' : U<. : 1 


[6.5? 


where E„ is the Bohr energy of the state in question. 

To complete the job, we need the expectation values of 1 jr and 1 / r 2 in the 
(unperturbed) state \jf n i m (Equation 4.89). The first is easy (see Problem 6.11): 


where a is the Bohr radius (Equation 4.72). The second is not so simple to derive 
(see Problem 6.28), but the answer is 7 



1 

(/+ 1/2 )n 2 a 2 ' 


[6.5 5. 


It follows that 


E, 


1 

2 me 2 


E 2 + 2 E n 


_L + (_fL) 2 _!_ 

\47reo/ n 2 a \4^6o / (/ + l/2)w 3 « 2 


or, eliminating a (using Equation 4.72) and expressing everything in terms of E r 
(using Equation 4.70), 


El = 


JL [_!=__ 3 

2mc 2 ]_/ + 1/2 


[6.56] 


Notice that the relativistic correction is smaller than E n by a factor of E n jmc 2 
2 x 10- 5 . 

You might have noticed that I used wendegenerate perturbation theory in this 
calculation even though the hydrogen atom is highly degenerate. But the pertur¬ 
bation is spherically symmetrical, so it commutes with L 2 and L z . Moreover, the 
eigenfunctions of these operators (taken together) have distinct eigenvalues for the 
n 2 states with a given E n . Luckily, then, the wave functions \(r n i m are “good” states 
for this problem, so as it happens the use of nondegenerate perturbation theory was 
legitimate. 


^Problem 6.11 Use the virial theorem (Problem 4.41) to prove Equation 6.54. 


7 The general formula for the expectation value of any power of r is given in Hans A. Bethe and 
Edwin E. Salpeter, Quantum Mechanics of One- and Two-Electron Atoms, (New York: Plenum, 1977). 
p. 17. 
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Problem 6.12 In Problem 4.43, you calculated the expectation value of r s in the 
state 1 /^ 321 . Check your answer for the special cases 5=0 (trivial), s = — 1 (Equation 
6.54), s = — 2 (Equation 6.55), and s — — 3 (Equation 6.63). Comment on the case 
5 = -7. 

**Problem 6.13 Find the (lowest-order) relativistic correction to the energy levels of 
the one-dimensional harmonic oscillator. Hint: Use the technique of Problem 2.37. 


6.3.2 Spin-Orbit Coupling 

Imagine the electron in orbit around the nucleus; from the electron’s point of view, 
the proton is circling around it (Figure 6.7). This orbiting positive charge sets up 
a magnetic field B in the electron frame, which exerts a torque on the spinning 
electron, tending to align its magnetic moment (jjl) along the direction of the field. 
The Hamiltonian (Equation 4.157) is 

H = -\L B. [6.57] 


The Magnetic Field of the Proton. If we picture the proton (from the elec¬ 
tron’s perspective) as a continuous current loop (Figure 6.7), its magnetic field can 
be calculated from the Biot-Savart law: 

2 r 


with an effective current / = e/T , where e is the charge of the proton and T is the 
period of the orbit. On the other hand, the orbital angular momentum of the electron 
(in the rest frame of the nucleus ) is L = rmv = Inmr 2 jT . Moreover, B and L point 
in the same direction (up, in Figure 6.7), so 


B = 


1 


tL. 


4tt€o mc 2 r 3 

(I used c = l/V^oho to eliminate in favor of €q.) 


[6.58] 



Figure 6.7: Hydrogen atom, from the 
electron’s perspective. 
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Figure 6.8: A ring of charge, rotating about its axis. 


The Magnetic Dipole Moment of the Electron. The magnetic dipole mo¬ 
ment of a spinning charge is related to its (spin) angular momentum; the proportional 
ity factor is the gyromagnetic ratio (which we already encountered in Section 4.4.2 
Let’s derive it, using classical electrodynamics. Consider first a charge q smearec 
out around a ring of radius r , which rotates about the axis with period T (Figure 6.8 
The magnetic dipole moment of the ring is defined as the current ( q/T) times the 
area (nr 2 ): 


qnr 2 
P - T 

If the mass of the ring is m, its angular momentum is the moment of inertia (mr 
times the angular velocity (2n j T): 


2nmr 2 


The gyromagnetic ratio for this configuration is evidently p/S = q/2m. Notice 
that it is independent of r (and T). If I had some more complicated object, such a- 
a sphere (all I require is that it be a figure of revolution, rotating about its axis), 1 
could calculate p and S by chopping it into little rings and adding their contributions 
As long as the mass and the charge are distributed in the same manner (so that the 
charge-to-mass ratio is uniform), the gyromagnetic ratio will be the same for each 
ring, and hence also for the object as a whole. Moreover, the directions of p and S 
are the same (or opposite, if the charge is negative), so 


fi = 


( 4 ) 


s. 


That was a purely classical calculation, however; as it turns out, the electron’s mag¬ 
netic moment is twice the classical answer: 


Pe 


—s. 

m 


[6.59] 
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The “extra” factor of 2 was explained by Dirac in his relativistic theory of the electron. 8 


The Spin-Orbit Interaction. Putting all this together, we have 


H = 



1 


S L. 


But there is a serious fraud in this calculation: I did the analysis in the rest frame 
of the electron, but that’s not an inertial system —it accelerates as the electron orbits 
around the nucleus. You can get away with this if you make an appropriate kinematic 
correction, known as the Thomas precession. 9 In this context it throws in a factor of 
1 / 2 : 

= ( ^—) -TTT S ' L [6.60] 

so \S7T€oJ m 2 c 2 r 2 

This is the spin-orbit interaction; apart from two corrections (the modified gyromag- 
netic ratio for the electron and the Thomas precession factor—which, coincidentally, 
exactly cancel one another), it is just what you would expect on the basis of a naive 
classical model. Physically, it is attributable to the torque exerted on the magnetic 
dipole moment of the spinning electron, by the magnetic field of the proton, in the 
electron’s instantaneous rest frame. 

Now the quantum mechanics. In the presence of spin-orbit coupling, the Hamil¬ 
tonian no longer commutes with L and S, so the spin and orbital angular momenta 
are not separately conserved (see Problem 6.14). However, H^ 0 does commute with 
L 2 , S 2 , and the total angular momentum 


J = L + S, [6.61] 

and hence these quantities are conserved (Equation 3.148). To put it another way, the 
eigenstates of L z and S z are not “good” states to use in perturbation theory, but the 
eigenstates of L 2 , S 2 , J 2 , and J z are. Now 

J 2 = (L+S) ■ (L+S) = L 2 + S 2 + 2L ■ S, 


8 We have already noticed that it can be dangerous to picture the electron as a spinning sphere 
(see Problem 4.26), and it is not too surprising that this naive classical model gets the gyromagnetic ratio 
wrong. Incidentally, quantum electrodynamics reveals tiny corrections to Equation 6.59; the calculation 
of the so-called anomalous magnetic moment of the electron stands as one of the greatest achievements 
of theoretical physics. 

9 One way of thinking of it is that we are continually stepping from one inertial system to another; 
Thomas precession amounts to the cumulative effect of all these Lorentz transformations. We could avoid 
this problem, of course, by staying in the lab frame, in which the nucleus is at rest. In that case the field of 
the proton is purely electric , and you may well wonder why it exerts any torque on the electron. Well, the 
fact is that a moving magnetic dipole acquires an electric dipole moment, and in the lab frame the spin-orbit 
coupling is due to the interaction of the electric field of the nucleus with the electric dipole moment of 
the electron. Because this analysis requires more sophisticated electrodynamics, it seems best to adopt 
the electron’s perspective, where the physical mechanism is more transparent. For a related discussion, see 
V. Namias, Am. J. Phys., 57, 171 (1989). 
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so 

L S = ^(J 2 - L 2 - S 2 ), [6.62 

and therefore the eigenvalues of L ■ S are 


+ 1) — Kl + 1) — s ( s +1)3- 

In this case, of course, s = 1/2. Meanwhile, the expectation value of 1/r 3 (see 
Problem 6.30) is 

<4> = , , ,1 , tee?: 


and we conclude that 

4 , = < 4 > = 


/(/ + l/ 2 )(/+ 1 )rc 3 tf 3 ’ 

e 2 L ( h 2 /2)U(.j + 1) - /(/ + 1) - 3/4] 


8^60 m 2 c 2 /(/ + 1 / 2 )(/ + 1 )n 3 a 3 

or, expressing it all in terms of E n : 

El \n[j(j +!)-/(/+!) -3/4]] 


EL = - 


1(1 + 1 / 2 ) (/ + 1 ) 


[6.64] 


It is remarkable, considering the totally different physical mechanisms in¬ 
volved, that the relativistic correction and the spin-orbit coupling are of the same 
order {Elfme 2 ). Adding them together, we get the complete fine-structure formula 
(see Problem 6.15): 


Ei = 


2 me 2 


3- 


4 n \ 

7+vv 


[6.65] 


Combining this with the Bohr formula, we obtain the grand result for the energy levels 
of hydrogen, including fine structure: 



[ 6 . 66 ] 


Fine structure breaks the degeneracy in / (that is, for a given n, the different allowed 
values of / do not all carry the same energy); the energies are determined by n and / 
(see Figure 6.9). The azimuthal eigenvalues for orbital and spin angular momentum 
{mi and m s ) are no longer “good” quantum numbers—the stationary states are linear 
combinations of states with different values of these quantities; the “good” quantum 
numbers are n, /, s, j, and m ,. 10 


10 To write | jrrij) (for given / and s) as a linear combination of \lmi)\sm s ) we would use the 
appropriate Clebsch-Gordan coefficients (Equation 4.185). 
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n = 


3 





7/2 

5/2 

3/2 

1/2 




= 3/2 
y - 1/2 


X:; 


- V . j= 1/2 

/ = 0 /=1 / = 2 /= 3 

(S) (P) (D) (F) 

Figure 6.9: Energy levels of hydrogen, including fine structure {not to scale). 


Problem 6.14 Evaluate the following commutators: (a) [L • S, L], (b) [L • S, S], 
(c) [L • S, J], (d) [L • S, L 2 ], (e) [L ■ S, S 2 ], (f) [L • S, J 2 ]. Hint : L and S satisfy the 
fundamental commutation relations for angular momentum (Equations 4.98, 4.99, 
and 4.134), but they commute with each other. 

*Problem6.15 Derive the fine structure formula (Equation 6.65) from the relativis¬ 
tic correction (Equation 6.56) and the spin-orbit coupling (Equation 6.64). Hint: Note 
that j = l ± 1/2; treat the plus sign and the minus sign separately, and you’ll find 
that you get the same final answer either way. 

** Problem 6.16 The most prominent feature of the hydrogen spectrum in the visible 
region is the red Balmer line, coming from the transition n ~ 3 to n = 2. First 
of all, determine the wavelength and frequency of this line, according to the Bohr 
theory. Fine structure splits this line into several closely spaced lines; the question 
is: How many, and what is their spacing ? Hint: First determine how many sublevels 
the n = 2 level splits into, and find E fs for each of these, in eV. Then do the same for 
n = 3. Draw an energy level diagram showing all possible transitions from n = 3 
to n = 2. The energy released (in the form of a photon) is (E 3 - E 2 ) + A E, the 
first part being common to all of them, and the AE (due to fine structure) varying 
from one transition to the next. Find A E (in eV) for each transition. Finally, convert 
to photon frequency, and determine the spacing between adjacent spectral lines (in 
Hz)— not the frequency interval between each line and the unperturbed line (which 
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is, of course, unobservable), but the frequency interval between each line and the nex: 
one. Your final answer should take the form, “The red Balmer line splits into (??'" 
lines. In order of increasing frequency, they come from the transitions (1) j = (?? 1 
to j = (???), (2) j — (???) to j — (???), and so on. The frequency spacing between 
line (1) and line (2) is (???) Hz, the spacing between line (2) and line (3) is (???) Hz. 
and so on.” 


Problem 6.17 The exact fine-structure formula for hydrogen (obtained from the 
Dirac equation without recourse to perturbation theory) is 11 


Eni = me 


1 + 


C - O' ■ 


- 1/2 


j + 1/2) + J(j + l/2) 2 -a 2 ) 

Expand to order a 4 (noting that a < 1), and show that you recover Equation 6.66. 


6.4 THE ZEEMAN EFFECT _ 

When an atom is placed in a uniform external magnetic field B ex t, the energy levels 
are shifted. This phenomenon is known as the Zeeman effect. For a single electron, 
the perturbation is 

H' 2 = ~(P'I + P's) ' ®ext» [6.67] 

where 

= [ 6 . 68 ] 

m 

is the magnetic dipole moment associated with electron spin, and 

m = ~L [6.69] 

is the dipole moment associated with orbital motion. 12 Thus 

H' z — (L + 2S) • B ext . [6.70] 

2m 

The nature of the Zeeman splitting depends critically on the strength of the 
external field in comparison with the internal field (Equation 6.58) that gives rise 
to spin-orbit coupling. For if B cu < B inU then fine structure dominates, and H z 
can be treated as a small perturbation, whereas if 5 ext > B mt , then the Zeeman 


^Bethe and Salpeter (footnote 7) page 83. 

12 The gyroraagnetic ratio for orbital motion is just the classical value (q/2m )—it is only for spin 
that there is an “extra” factor of 2. 
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effect dominates, and fine structure becomes the perturbation. In the intermediate 
zone, where the two fields are comparable, we need the full machinery of degenerate 
perturbation theory, and it is necessary to diagonalize the relevant portion of the 
Hamiltonian “by hand”. In the following sections we shall explore each of these 
regimes briefly, for the case of hydrogen. 

Problem 6.18 Use Equation 6.58 to estimate the internal field in hydrogen, and 
characterize quantitatively a “strong” and “weak” Zeeman field. 


6.4.1 Weak-Field Zeeman Effect 


If i? ext <<C B{ nt , fine structure dominates (Equation 6.66); the “good” quantum num¬ 
bers are n, /, j, and mj (but not w/ and m s , because—in the presence of spin-orbit 
coupling—L and S are not separately conserved). In first-order perturbation theory, 
the Zeeman correction to the energy is 

E' z = (nljmj\H' z \nljmj) = ^B„ t • <L + 2S). [6.71] 

Now L + 2S = J + S; unfortunately, we do not immediately know the expectation 
value ofS. But we can figure it out as follows: The total angular momentum J = L+S 
is constant (Figure 6.10); L and S precess rapidly about this fixed vector. In particular, 
the (time) average value of S is just its projection along J: 


S -(U>I F6 791 

®ave — j2 *»• [6.72J 

But L = J-S, so L 2 = J 2 + S 2 — 2J • S, and hence 

1 h 2 

S - J = -(J 2 + S 2 - L 2 ) = -UU + 1) + s(s + 1) - 1(1 + 1)], [6.73] 

from which it follows that 


<L + 2S) = {(l + 5-i)j)= i- 


j(j +!)-/(/ + !) + 3/4 

2 JU + 1) 


(J). [6.74] 



Figure 6.10: In the presence of spin-orbit 
coupling, L and S are not separately 
conserved; they precess about the fixed 
total angular momentum, J. 
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Figure 6.11: Weak-field Zeeman 
splitting of the ground state; the 
upper line (mj = 1/2) has slope 1, tht- 
lower line (mj = —1/2) has slope — I 


The term in square brackets is known as the Lande g-factor, gj. 

We may as well choose the z-axis to lie along B ex t; then 

E l z = fiBgjB ext mj, [6.~5 


where 

n B =*L-= 5.788 x 1(T 5 eV/T [6.76 

2m 

is the so-called Bohr magneton. The total energy is the sum of the fine-structure 
part (Equation 6.66) and the Zeeman contribution (Equation 6.75). For example, the 
ground state (n = 1, / = 0, j = 1/2, and therefore gj = 2) splits into two levels: 

- 13.6 eV(l + a 2 /4) ± (i B B ext , [6.7" 

with the plus sign for mj = 1/2, and minus for mj = —1/2. These energies are 
plotted (as functions of 5 ext ) in Figure 6.11. 

^Problem 6.19 Consider the (eight) n = 2 states, |2 / jmj). Find the energy o: 
each state, under weak-field Zeeman splitting, and construct a diagram like Figure 
6.11 to show how the energies evolve as B cn increases. Label each line clearly, anc 
indicate its slope. 


6.4.2 Strong-Field Zeeman Effect 

If 7? ext » B mt , the Zeeman effect dominates 13 ; with 7? ex t in the z direction, the “good' 
quantum numbers are now n, /, mi, and m s (but not j and mj because—in the presence 
of the external torque—the total angular momentum is not conserved, whereas L z and 


13 In this regime the Zeeman effect is also known as the Paschen-Back effect. 



Sec. 6.4 The Zeeman Effect 247 


S z are). The Zeeman Hamiltonian is 

H' z = ~B m (L z + 2S z ), 

2m 

and the “unperturbed” energies are 

13.6 eV 

Enmim, — ^ h dB / + 2 m s ). [6.78] 

n L 

In first-order perturbation theory, the fine-structure correction to these levels is 

E l k = ( nlmim s \(H’ r + H r so )\nlmim s ). [6.79] 

The relativistic contribution is the same as before (Equation 6.56); for the spin-orbit 
term (Equation 6.60) we need 

<S ■ L} = (S X )(L X ) + (S y ){L y ) + (S Z )(L Z ) = h 2 mim s [6.80] 


(note that { S x ) = {S y ) = (L x ) = (L y ) = 0 for eigenstates of S z and L z ). Putting all 
this together (Problem 6.20), we conclude that 


13.6 eV 2 [ 3 f /(/+l)-ffl/m s l) 
fs »3 “ |4 n [/(/ + 1/2)(/ + \)\\ 


[6.81] 


(The term in square brackets is indeterminate for 1 = 0; its correct value in this case 
is 1—see Problem 6.22.) The total energy is the sum of the Zeeman part (Equation 
6.78) and the fine-structure contribution (Equation 6.81). 


Problem 6.20 Starting with Equation 6.79 and using Equations 6.56, 6.60, 6.63, 
and 6.80, derive Equation 6.81. 


**Problem 6.21 Consider the eight n = 2 states, |2 lmim s ). Find the energy of 
each state, under strong-field Zeeman splitting. (Express your answers as the sum of 
three terms, as in Equation 6.77: the Bohr energy; the fine structure, proportional to 
a 2 ; and the Zeeman contribution, proportional to {x B 5 ext .) If you ignore fine structure 
altogether, how many distinct levels are there, and what are their degeneracies? 


Problem 6.22 If / = 0, then j = 5, mj = m s , and the “good” states are the 
same (| nm s )) for weak and strong fields. Determine E ] z (from Equation 6.71) and 
the fine structure energies (Equation 6.66), and write down the general result for the 
1 = 0 Zeeman effect —regardless of the strength of the field. Show that the strong- 
field formula (Equation 6.81) reproduces this result, provided that we interpret the 
indeterminate term in square brackets as 1. 
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6.4.3 Intermediate-Field Zeeman Effect 


In the intermediate regime, neither H z nor H^ s dominates, and we must treat the two 
on an equal footing, as perturbations to the Bohr Hamiltonian (Equation 6.41): 

H' = H f z + Hi. [6.82] 


I’ll confine my attention here to the case n = 2 and choose as the basis for de¬ 
generate perturbation theory the states characterized by /, j, and m r ' 4 Using the 
Clebsch-Gordan coefficients (Problem 4.45 or Table 4.7) to express | j mj) as a linear 
combination of |/ mi) | j m s ), we have 


ti = iH) = 

fi = |l^l) = |00>|if), 


fl = \\ l> 
t* = I|t> 
ft = W\) 
ft, = \\ )} 
fi = Iit) 
= Iit> 


H -1)|JT>. 
7573|i0)|ii> 
-yi73|io)|ii) 
717311-i)iH> 

-^73|i-i)|ii) 


+ VT73|ii)l^>. 
+ V273|ii)l^>: 
+ 7273|io)|i^> ; 
+ VV3|10)|i^>. 


In this basis the nonzero matrix elements of /f/ s are all on the diagonal, and 
given by Equation 6.65; H z has four off-diagonal elements, and the complete matrix 
— W is (see Problem 6.23) 


/5 y-P 0 0 

0 5y + P 0 

0 0 y- 2@ 

0 0 0 

0 0 0 

0 0 0 

0 0 0 

\0 0 0 


0 0 0 

0 0 0 

0 0 0 

Y + 2fi 0 0 

0 y- id f/? 

0 5y-\p 

0 0 0 

0 0 0 


where 


y = (a/8) 2 13.6 eV and ft = /z s 5 e xt- 


0 0 , 

0 0 

0 0 

0 0 

0 0 

0 0 

Y + \P 

=rP 5r + |/J/ 


The first four eigenvalues are displayed along the diagonal; it remains only to find the 
eigenvalues of the two 2x2 blocks. The characteristic equation for the first is 


k +k(6y—fi) + (5y — —yfi)— 0, 


14 You can use l,mi,m s states if you prefer—this makes the matrix elements of H’ z easier but those 
of H' k more difficult; the W -matrix will be more complicated, but its eigenvalues (which are independent 
of basis) are the same either way. 
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Table 6.2: Energy levels for the n = 2 states of hydrogen, with fine structure and 
Zeeman splitting. 


ei = E 2 -5y+p 

e 2 = E 2 - 5y - f> 

€ 3 = E 2 -y + 2f 

€4 = E 2 - y - 2f 

e 5 = E 2 - 3y + p/2 + y j 4y 2 + (2/3)yj6 + p 2 /4 

e 6 = E 2 - 3y + p/2 ~ ^4y 2 + (2/3)yP + p 2 /4 

67 = E 2 -3y- p/2 + y V ~ (2/3)yP + P 2 /4 

eg = E 2 -3y- p/2 - y/4y 2 — (2/3)yP + p 2 /4 


and the quadratic formula gives the eigenvalues: 

= -3/ + W2) ± V47 2 + (2/3)r/3 + (f) 2 / 4). [6.83] 

The eigenvalues of the second block are the same, but with the sign of fi reversed. 
The eight energies are listed in Table 6.2, and plotted against B sxl in Figure 6.12. In 
the zero-field limit (fi = 0) they reduce to the fine-structure values; for weak fields 
(/? ^ y) they reproduce what you got in Problem 6.19; for strong fields (J3 » y) 
we recover the results of Problem 6.21 (note the convergence to five distinct energy 
levels, at very high fields, as predicted in Problem 6.21). 

Problem 6.23 Work out the matrix elements of H' z and Hf, and construct the 
Vk-matrix given in the text, for n = 2. 

***Problem 6.24 Analyze the Zeeman effect for the n = 3 states of hydrogen in the 
weak, strong, and intermediate field regimes. Construct a table of energies (analogous 



Figure 6.12: Zeeman splitting of the n = 2 
states of hydrogen in the weak, 
intermediate, and strong field regimes. 



250 Chap. 6 Time-Independent Perturbation Theory 


to Table 6.2), plot them as functions of the external field (as in Figure 6.12), and check 
that the intermediate-field results reduce properly in the two limiting cases. 


6.5 HYPERFINE SPLITTING _ 

The proton itself constitutes a magnetic dipole, though its dipole moment is much 
smaller than the electron’s because of the mass in the denominator (Equation 6.59): 



(The proton is a composite structure, made up of three quarks, and its gyromagnetic 
ratio is not as simple as the electron’s—hence the g-factor, 15 whose measured value 
is 5.59 as opposed to 2.00 for the electron.) According to classical electrodynamics, 
a dipole fi sets up a magnetic field 16 

B = -^ 1 [3(jl-r)f-n] + ^u.S 3 (T). [6.851 

4jrr 3 3 

So the Hamiltonian (Equation 6.57) of the electron, in the magnetic field due to the 
proton’s magnetic dipole moment, is 

HU = ^ + ***.*, . S^(r). [6.861 

8 7T rn p fn e r 3 3 m p m e 

According to perturbation theory, the first-order correction to the energy (Equa¬ 
tion 6.9) is the expectation value of the perturbing Hamiltonian: 


£ \ doge 2 3 (S p • r)(S e • r) - S p • S e 
hf 8 7tm p m e r 3 

+ ^ogfl ( S Se )|^(0)| 2 . [6.87] 

3 m p m e 

In the ground state (or any other state for which / = 0) the wave function is spherically 
symmetrical, and the first expectation value vanishes (see Problem 6.25). Meanwhile, 
from Equation 4.80 we find that IVoooCO)! 2 = 1/(7 ra 3 ), so 


El = 


doge 2 

3itm p m e a 3 


(S p -Se), 


[ 6 . 88 ] 


15 The Lande g-factor, in Equation 6.74, plays a similar role in the proportionality between the 
electron’s total magnetic moment (flj + fi s ) and its total angular momentum J, 

16 If you are unfamiliar with the delta function term in Equation 6.85, you can derive it by treating 
the dipole as a spinning charged spherical shell, in the limit as the radius goes to zero and the charge goes 
to infinity (with /i held constant). See D. J. Griffiths, Am. J. Phys. 50, 698 (1982). 
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Figure 6.13: Hyperfine splitting in the ground state of hydrogen. 

in the ground state. This is called spin-spin coupling because it involves the dot 
product of two spins (contrast spin-orbit coupling, which involves S • L). 

In the presence of spin-spin coupling, the individual spin angular momenta are 
no longer conserved; the “good” states are eigenvectors of the total spin, 

S = S e + S p . [6.89] 

As before, we square this out to get 

VS e=\(S l -Sl-S 1 p ). [6.90] 

But the electron and proton both have spin 1/2, so S 2 = S 2 p — (3/4)// 2 . In the triplet 
state (spins “parallel”) the total spin is 1, and hence S 2 = 271 2 ; in the singlet state the 
total spin is 0, and S 2 = 0. Thus 


4g^ 4 +1 /4, (triplet); 

3 m p mlc 2 a A —3/4, (singlet). 


[6.91] 


Spin-spin coupling breaks the spin degeneracy of the ground state, lifting the 
triplet configuration and depressing the singlet (see Figure 6.13). 

The energy gap is evidently 


Apt, 4 

A E = -- . , = 5.88 x 10~ 6 eV. [6.92] 

3 m p m l e c l a 4 

The frequency of the photon emitted in a transition from the triplet to the singlet state 
is 

v = — = 1420 MHz, [6.93] 

h 

and the corresponding wavelength is c/v = 21 cm, which falls in the microwave 
region. This famous “21 -centimeter line” is among the most pervasive and ubiquitous 
forms of radiation in the universe. 
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Problem 6.25 Let a and b be two constant vectors. Show that 
f 4n 

I (a • r)(b • r) sin 6 d6 d<f> = — (a- b). [6.94 

The integration is over the usual range: 0<9<n,Q<<f)<27T. Use this result to 
demonstrate that 


3(S p -f)(S e -r)-S f> -S e ) _ Q 

r 3 

for states with / = 0. Hint: f = sin 9 cos (f>i + sin 9 sin <p j + cos 9k. 


Problem 6.26 By appropriate modification of the hydrogen formula, determine the 
hyperfine splitting in the ground state of (a) muonic hydrogen (in which a muon— 
same charge and g-factor as the electron, but 207 times the mass—substitutes for 
the electron), (b) positronium (in which a positron—same mass and g-factor as the 
electron, but opposite charge—substitutes for the proton), and (c) muonium (in which 
an antimuon—same mass and g-factor as a muon, but opposite charge—substitutes for 
the proton). Hint: Don’t forget to use the reduced mass (Problem 5.1) in calculating 
the “Bohr radius” of these exotic “atoms.”. Incidentally, the answer you get for 
positronium (4.85 x 10 -4 eV) is quite far from the experimental value (8.41 x 10“' 
eV); the large discrepancy is due to pair annihilation (e + + e~ -* y + y), which 
contributes an extra (3/4)A£ and does not occur (of course) in ordinary hydrogen, 
muonic hydrogen, or muonium. See Griffiths (footnote 16) for further details. 


FURTHER PROBLEMS FOR CHAPTER 6 _ 

**Problem 6.27 Suppose the Hamiltonian H, for a particular quantum system, 
is a function of some parameter A.; let E n (X) and ir n (X) be the eigenvalues and 
eigenfunctions of H(X). The Feynman-Hellmann theorem states that 

^ = [6.95] 

(assuming either that E n is nondegenerate, or—if degenerate—that the ir n ’s are the 
“good” linear combinations of the degenerate eigenfunctions). 

(a) Prove the Feynman-Hellmann theorem. Hint: Use Equation 6.9. 

(b) Apply it to the one-dimensional harmonic oscillator, (i) using X = to (this yields 
a formula for the expectation value of V), (ii) using X—h (this yields (7 1 )), and 
(iii) using X = m (this yields a relation between (T) and {V)). Compare your 
answers to Problem 2.37 and the virial theorem predictions (Problem 3.53). 




Further Problems for Chapter 6 253 


**Problem 6.28 The Feynman-Hellmann theorem (Problem 6.27) can be used to 
determine the expectation values of 1/r and 1 jr 1 for hydrogen. 17 The effective 
Hamiltonian for the radial wave functions is (Equation 4.53) 

H _ ft 2 d 2 ti 2 /(/ +1) e 2 1 

2m dr 2 2m r 2 4Jte 0 r ’ 

and the eigenvalues (expressed in terms of /) 18 are (Equation 4.70) 


32jr 2 ^7j 2 0' max + / + l) 2 

(a) Use X = e in the Feynman-Hellmann theorem to obtain (1/r). Check your 
result against Equation 6.54. 

(b) Use X — l to obtain (1/r 2 ). Check your answer with Equation 6.55. 


***Problem 6.29 Prove Kramers’ relation: 

^rV) - (2s + l)tf (r*- 1 ) + -U21 + l) 2 - S 2 ]a 2 (r s - 2 ) = 0, [6.96] 

n l 4 

which relates the expectation values of r to three different powers (5, 5 — 1, and 
s — 2), for an electron in the state ijr n / m of hydrogen. Hint: Rewrite the radial 
equation (Equation 4.53) in the form 


u 


' /(/ + 1) 
r 2 


2 1 1 

— + ~Y"2 M. 

ar n l a l J 


and use it to express j(ur s u ,! )dr intermsof (r s ), (r s_1 ),and ( r s ~ 2 ). Then use integra¬ 
tion by parts to reduce the second derivative. Show that j(ur s u')dr = —(s/2)(r s ~ l ), 
and f ( u'r s u')dr = —[2 f(s + 1)] f (u”r s+l u')dr. Take it from there. 

Problem 6.30 


(a) Plug 5=0, 5 = 1,5 = 2, and 5 = 3 into Kramers’ relation (Equation 6.96) 
to obtain formulas for (r _1 ), (r), (r 2 ), and (r 3 ). Note that you could continue 
indefinitely, to find any positive power. 

(b) In the other direction, however, you hit a snag. Put in 5 = —1, and show that 
all you get is a relation between (r~ 2 ) and ( r ~ 3 ). 


17 C. Sanchez del Rio, Am. J. Phys., 50, 556 (1982); H. S. Valk, Am. J. Phys ., 54, 921 (1986). 

18 In part (b) we treat l as a continuous variable; n becomes a function of l, according to Equation 
4.67, because / Inax , which must be an integer, is fixed. To avoid confusion, I have eliminated n, to reveal 
the dependence on l explicitly. 
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(c) But if you can get { r~ 2 ) by some other means, you can apply the Kramers' 
relation to obtain the rest of the negative powers. Use Equation 6.55 (which 
is derived in Problem 6.28) to determine (r -3 ), and check your answer against 
Equation 6.63. 

***Problem 6.31 When an atom is placed in a uniform external electric field E ex t, the 
energy levels are shifted—a phenomenon known as the Stark effect. In this problem 
we analyze the Stark effect for the n = 1 and n —2 states of hydrogen. Let the field 
point in the z direction, so the potential energy of the electron is 

H' s = — eE ext z = —eE ext r cos 9. 

Treat this as a perturbation on the Bohr Hamiltonian (Equation 6.41); spin is irrelevant 
to this problem, so ignore it. 

(a) Show that the ground-state energy is not affected by this perturbation, in first 
order. 

(b) The first excited state is fourfold degenerate: ^200, ^211, ^210, ^21-1- Using de¬ 
generate perturbation theory, determine the first-order corrections to the energy. 
Into how many levels does Ej split? 

(c) What are the “good” wave functions for part (b)? Find the expectation value of 
the electric dipole moment (p e = — er), in each of these “good” states. Notice 
that the results are independent of the applied field—evidently hydrogen in its 
first excited state can carry a permanent electric dipole moment. 

Hint : There are a lot of integrals in this problem, but almost all of them are 
zero. So study each one carefully before you do any calculations: If the 0 integral 
vanishes, there’s not much point in doing the r and 9 integrals! Partial answer 
W \3 = W 31 = 3eaE ex t ; all other elements are zero. 

***Problem 6.32 Consider the Stark effect (Problem 6.31) for the n — 3 states of 
hydrogen. There are initially nine degenerate states, 03/ m (neglecting spin, of course), 
and we turn on an electric field in the z direction. 

(a) Construct the 9 x 9 matrix representing the perturbing Hamiltonian. Partial an¬ 
swer: (300|z|3 1 0) = -3vfefl, <3 1 0jz|320) = -3^3a, (3 1 ±1 |z[32±1) = 
-(9/2 )a. 

(b) Find the eigenvalues and their degeneracies. 

Problem 6.33 Calculate the wavelength, in centimeters, of the photon emitted 
under a hyperfine transition in the ground state (n = 1) of deuterium. Deuterium 
is “heavy” hydrogen, with an extra neutron in the nucleus. The proton and neutron 
bind together to form a deuteron, with spin 1 and magnetic moment 
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Figure 6.14: Hydrogen atom surrounded by six point charges (crude model for 
a crystal lattice); Problem 6.34. 


the deuteron g-factor is 1.71. 


i/ - gd6 <i . 


***Problem 6.34 In a crystal, the electric field of neighboring ions perturbs the energy 
levels of an atom. As a crude model, imagine that a hydrogen atom is surrounded by 
three pairs of point charges, as shown in Figure 6.14. 

(Spin is irrelevant to this problem, so ignore it.) 


(a) Assuming that r <^d\,r d 2 , and r <£ ds, show that 

H'=V„+ 3(frx 2 + fry 2 + frz 2 ) - (fr +fr + fr)r 2 , 

where 

Pi = ~ ^ JT€ ^ 3 ’ anC * V 0 = 2(P\df + P 2 d\ + # 3^3 )• 

(b) Find the lowest-order correction to the ground-state energy. 

(c) Calculate the first-order corrections to the energy of the first excited states (n = 
2). Into how many levels does this fourfold degenerate system split, (i) in the 
case of cubic symmetry, Pi — p 2 — Py, (ii) in the case of orthorhombic 
symmetry, Pi = p 2 # P 3 ; (iii) in the general case of tetragonal symmetry 
(all three different)? 



CHAPTER 7 


THE VARIATIONAL PRINCIPLE 


7.1 THEORY _ 

Suppose you want to calculate the ground-state energy E g for a system described b\ 
the Hamiltonian H, but you are unable to solve the (time-independent) Schrodinger 
equation. Pick any normalized Junction Jr whatsoever. 

Theorem: 

[7.1] 

That is, the expectation value of H in the (presumably incorrect) state Jr is certain to 
overestimate the ground-state energy. Of course, if Jr just happens to be one of the 
excited states, then obviously (H) exceeds E g \ but the theorem says that the same 
holds for any Jr whatsoever. 

Proof: Since the (unknown) eigenfunctions of H form a complete set, we can 
express Jr as a linear combination of them 1 : 

Jr = ^ c n Jr n , with HJr n — E n Jr n . 

1 If the Hamiltonian admits scattering states, as well as bound states, then we’ll need an integral as 
well as a sum, but the argument is unchanged. 
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Since \{r is normalized, 

1 = {f\f) = (J2 C mfm\^2c n f n ) = J2Y^ C * m C n (ir m \ifn) = 

(assuming the eigenfunctions have been orthonormalized: {f m \ir n ) = 8 mn ). Mean¬ 
while, 

(H) = (Y^cMH^cM =Y^clE„cAirMn) = 

But the ground-state energy is, by definition, the smallest eigenvalue, so E g < E n , 
and hence 

(H) > E^Cnl 2 = E s . 


QED 


Example 1. Suppose we want to find the ground-state energy for the one¬ 
dimensional harmonic oscillator: 


r d 2 1 , 2 

H= -^ + r ax 


Of course, we already know the exact answer, in this case (Equation 2.49): 
E g — (1/2but this makes it a good test of the method. We might pick as 
our “trial” wave function the gaussian. 


f (x) = Ae- bx \ 

where b is a constant and A is determined by normalization: 

Now 

(H) = (T) + (V), 

where, in this case, 


( T) 


2m J_ 0O dx 2 V / 2m 


and 


so 


1 oof 00 1 , moy 

(V) = ~mco 2 \A\ 2 J e 2b x 2 dx = 


8 b ' 


frb mar 

{H) =2m + ltt- 


[7.2] 


[7.3] 

[7.4] 


[7.5] 


[7.6] 
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According to the theorem, this exceeds E g for any b\ to get the tightest bound 
let’s minimize ( H) with respect to b : 


d n mto 

db^ ^ 2m 8 b 2 


b m ° J 


Putting this back into (H), we find 


{ 71) min = -lUO. 


[77 


In this case we hit the ground-state energy right on the nose—because (obviously \ I 
“just happened” to pick a trial function with precisely the form of the actual ground 
state (Equation 2.48). But the gaussian is very easy to work with, so it’s a popular 
trial function even when it bears little resemblance to the true ground state. 


Example 2. 

function potential: 


Suppose we’re looking for the ground state energy of the delta- 


h 2 d 2 


Again, we already know the exact answer (Equation 2.109): E g = —ma 2 /2h 2 . A' 
before, we’ll use a gaussian trial function (Equation 7.2). We’ve already determined 
the normalization and calculated ( T ); all we need is 


{V) = -a\A\ 2 


/: 


r 2lal S(x)dx = -a 

V 71 


Evidently, 



and we know that this exceeds E g for all b. Minimizing it, 


d h 2 a 2 m 2 a 2 

— (H) = ---= = 0 ^ b= - r - 

db 2m «/2nb izh A 


So 


(77) min — 


ma 2 
7th 2 ’ 


which is indeed somewhat higher than E g , since 7r > 2. 


[7.8 


[7.9] 


I said you can use any (normalized) trial function t/r whatsoever, and this is 
true in a sense. However, for discontinuous functions it takes some fancy footwork 
to assign a sensible interpretation to the second derivative (which you need, in order 
to calculate (T)). Continuous functions with kinks in them are fair game, however: 
the next example shows how to handle them. 
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Example 3. Find an upper bound on the ground-state energy of the one¬ 
dimensional infinite square well (Equation 2.15), using the “triangular” trial wave 
function (Figure 7.1) 2 : 


f{x) = 


Ax, if 0 < < a/2, 

A (a - x), if a/2 <x<a, 
0 , otherwise, 


where A is determined by normalization: 


[7.10] 


1 = \A\ 2 x 2 dx + j (a - x) 1 dx J = 


W T2 


a- 2 -A 

ay a 


[7.11] 


In this case 


dty 

dx 


A, if 0 < x < a/2, 
-A, if<a/2 <x<a, 
0 , otherwise. 


as indicated in Figure 7.2. Now, the derivative of a step function is a delta function 
(see Problem 2.24b): 


d 2 f 

Tx 2 “ 


= ;4«$(je) - 2 AS(x - a/2) + A8(x - a). 


[7.12] 


and hence 


fi 2 A C 

( H) = -/ [<5(*) - 2 S(x - a/2) + <5(x - a)]ir{x) dx 

2m J 


j/ 2 A ft 2 A 2 a 

= ~ “ 2 ^(«/ 2 ) + ^(a)] = 2m . : 


m 2 

' 2ma 2 ' 


[7.13] 



Figure 7.1: Triangular trial wave function for the infinite square well (Equation 7.10). 


2 There is no point in trying a function (such as the gaussian) that extends outside the well, because 
you’ll get (J 7 ) = oo, and Equation 7.1 tells you nothing. 
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Figure 7.2: Derivative of the wave function in Figure 7.1. 

The exact ground state is E g — n 2 Ti 2 jlma 2 (Equation 2.23), so the theorem works 
(12 > 7T 2 ). 


The variational principle (as Equation 7.1 is called) is extremely powerful, 
and embarrassingly easy to use. What a chemist does, to find the ground-state energ\ 
of some complicated molecule, is write down a trial wave function with a large number 
of adjustable parameters, calculate (H), and tweak the parameters to get the lowest 
possible value. Even if t/t has no relation to the true wave function, one often gets 
miraculously accurate values for E g . Naturally, if you have some way of guessing a 
realistic \{r, so much the better. The only trouble with the method is that you never 
know for sure how close you are to the target—all you can be certain of is that you’ve 
got an upper bound . Moreover, the technique applies only to the ground state (see. 
however, Problem 7.4). 

^Problem 7.1 Use the gaussian trial function (Equation 7.2) to obtain the low¬ 
est upper bound you can on the ground-state energy of (a) the linear potential: 
V(x) — a|x|; (b) the quartic potential: V (jc) = ax 4 . 


**Problem 7.2 Find the best bound on E g for the one-dimensional harmonic oscil¬ 
lator using a trial wave function of the form 


xjr(x) = 


A 

x 2 + b 2 ’ 


where A is determined by normalization and b is an adjustable parameter. 


Problem 7.3 Find the best bound on E g for the delta-function potential —a8(x - 
a/2), using the triangle trail function (Equation 7.10). (This time a is an adjustable 
parameter.) 
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Problem 7.4 

(a) Prove the following corollary to the variational principle: If (f\f g ) = 0, then 
(H) > Ef, where Ef is the energy of the first excited state. 

Thus, if we can find a trial function that is orthogonal to the exact ground state, we 
can get an upper bound on the first excited state. In general, it’s difficult to be sure 
that i (r is orthogonal to \jr g , since (presumably) we don’t know the latter. However, if 
the potential V (x) is an even function of x, then the ground state is likewise even, and 
hence any odd trial function will automatically meet the condition for the corollary. 

(b) Find the best bound on the first excited state of the one-dimensional harmonic 
oscillator using the trial function 

f(x) = Axe~ bx2 . 

Problem 7.5 

(a) Use the variational principle to prove that first-order nondegenerate perturbation 
theory always overestimates (or at any rate never underestimates) the ground- 
state energy. 

(b) In view of (a), you would expect that the second -order correction to the ground 
state is always negative. Confirm that this is indeed the case, by examining 
Equation 6.14. 


7.2 THE GROUND STATE OF HELIUM _ 

The helium atom (Figure 7.3) consists of two electrons in orbit around a nucleus 
containing two protons (also some neutrons, which are irrelevant to our purpose). 
The Hamiltonian for this system (ignoring fine structure and smaller corrections) is 


= -£( v ? + vl) 


+ 1 \ 

47 T 60 \n r 2 | ri — r 2 1 / 


[7.14] 


Our problem is to calculate the ground-state energy, E g —the amount of energy it 
would take to strip off the two electrons. (Given E g it is easy to figure out the 
“ionization energy” required to remove a single electron—see Problem 7.6.) E g has 
been measured very accurately in the laboratory: 


E g = —78.975 eV (experimental). 
This is the number we would like to reproduce theoretically. 


[7.15] 
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Figure 7.3: The helium atom. 


It is curious that such a simple and important problem has no known exact 
solution. 3 The trouble comes from the electron-electron repulsion, 


V e 


e 2 1 

47 reo In - r 2 | 


[7.16] 


If we ignore this term altogether, H splits into two independent hydrogen Hamilto¬ 
nians (only with a nuclear charge of 2e, instead of *>); the exact solution is just the 
product of hydrogenic wave functions: 


iAo(ri, r 2 ) = tAiooCrOVrooto) = —+'*>/«, [7.17] 

na 5 

and the energy is 8£] = -109 eV (Eq. [5.31]). 4 This is a long way from -79 eV, 
but it’s a start. 

To get a better approximation for E g , we’ll apply the variational principle, using 
x//o as the trial wave function. This is a particularly convenient choice because it’s an 
eigenfunction of most of the Hamiltonian: 



Hxh = (8£, + V ee )f Q . 

[7-18] 

Thus 

{H) = 8£j + (V ee ), 

[7.19] 

where 5 

/ e 2 \ / 8 \ 2 f e ~4(n+r 2 )/a 

[7.20] 


3 There do exist exactly soluble three-body problems with many of the qualitative features of helium, 
but using non-Coulombic potentials (see Problem 7.15). 

4 Here a is the ordinary Bohr radius and E n = -13.6/n 2 eV is the «th Bohr energy; recall that 
for a nucleus with atomic number Z, E„ Z 2 E„ and a -» a/Z (Problem 4.17). The spin configuration 
associated with Equation 7.17 will be antisymmetric (the singlet). 

5 You can, if you like, interpret Equation 7.19 as first-order perturbation theory, with V ee as II'. 
However, I regard this as a misuse of the method, since the perturbation is roughly equal in size to the 
unperturbed potential. I prefer, therefore, to think of it as a variational calculation, in which we are looking 
for an upper bound on E g . 
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x 2 

Figure 7.4: Choice of coordinates for the r 2 integral (Equation 7.20). 


I’ll do the T 2 integral first; for this purpose iq is fixed, and we may as well orient the 
r 2 coordinate system so that the polar axis lies along ri (see Figure 7.4). By the law 
of cosines, 

[ri — r 2 | = Jrl + r\ — 2rir 2 cos^ 2) [7.21] 


f e - Ari ! a , f 

h = J iiw^i dr2 = J T r 


} sin Oi dr 2 d02d(j)2- [7.22] 


Y r f + r 2 ~~ 2r l r 2 C0S ^2 


The 02 integral is trivial (2tt); the 0 2 integral is 


\ r i + r 2 ~ 2r l r 2 cos0 2 


y r\ + r% — 2 r\r 2 cos 9 2 


l — (Jr* + rf +2rjr 2 - y/ r\ + r\ - 2rjr 2 j 

■w (2 jH e-^ la r\ dr 2 + £ 


[7.24] 




Chap. 7 The Variational Principle 


It follows that (V ee ) is equal to 

(i) (i) /1 1 - (' + t) e " 4r,/ “] si "*' dr ' M ' d *'- 

The angular integrals are easy (4 n), and the n integral becomes 

Ll re ~ ir/ “-{ r+2 ^y r/ °] dr = m 

Finally, then, 

{Vee) = ^ ) = - 5 -E x = 34 eV, [7.25 

4 a \47re 0 / 2 

and therefore 

{H) = -109 eV + 34 eV = -75 eV. [7.26; 

Not bad (remember, the experimental value is —79 eV). But we can do better. 

Can we think of a more realistic trial function than \ /r 0 (which treats the tv. o 
electrons as though they did not interact at all)? Rather than completely ignoring 
the influence of the other electron, let us say that, on the average, each electron 
represents a cloud of negative charge which partially shields the nucleus, so that the 
other electron actually sees an effective nuclear charge (Z) that is somewhat less than 2. 
This suggests that we use a trial function of the form 


t/q(ri, r 2 ) = — e -^+n)!a_ [ 7 . 27 ] 

7i a 5 

We’ll treat Z as a variational parameter, picking the value that minimizes (H). 

This wave function is an eigenstate of the “unperturbed” Hamiltonian (neglect¬ 
ing electron repulsion), but with Z, instead of 2, in the Coulomb terms. With this in 
mind, we rewrite H (Equation 7.14) as follows: 




v- 


-( 

<?o V 


r 
4 ttcq 
(Z-2) ( (Z — 2) 


(K) 


47T€ 0 V n 
The expectation value of H is evidently 


n 


Ii*i - 


b) 


(H) = 2Z 2 E i + 2(Z - 2) ) (-) + (Vee). 

\47T€qJ r 


>(±V- 


[7.28] 


[7.29] 


Here (\/r) is the expectation value of 1/r in the (one-particle) hydrogenic ground 
state \J /\oo (but with nuclear charge Z); according to Equation 6.54, 
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The expectation value of V ee is the same as before (Equation 7.25), except that instead 
of Z = 2 we now want arbitrary Z —so we multiply a by 2/Z: 


5Z 




[7.31] 


Putting all this together, we find 

(H) = [2Z 2 - 4Z(Z - 2) - (5/4)Z] E r = [-2Z 2 + (27/4)Z]£,. [7.32] 

According to the variational principle, this quantity exceeds E g for any value 
of Z. The lowest upper bound occurs when (H) is minimized: 

A(i/) = [_4z + (27/4)]£, =0, 

dZ 

from which it follows that 

Z = 77 = L69 ' [733] 

16 

This is a reasonable result; it tells us that the other electron partially screens the 
nucleus, reducing its effective charge from 2 down to 1.69. Putting in this value for 
Z, we find 

W-i(l) *~77.S.V. [7.34] 

The ground state of helium has been calculated with great precision in this way, 
using increasingly complicated trial wave functions with more and more adjustable 
parameters. 6 But we’re within 2% of the correct answer, and, frankly, at this point 
my own interest in the problem begins to fade. 

Problem 7.6 Using Eg = -79.0 eV for the ground-state energy of helium, cal¬ 
culate the ionization energy (the energy required to remove just one electron). Hint 
First calculate the ground-state energy of the helium ion, He + , with a single electron 
orbitting the nucleus; then subtract the two energies. 

^Problem 7.7 Apply the techniques of this Section to the H“ and Li + ions (each has 
two electrons, like helium, but nuclear charges Z = 1 and Z = 3, respectively). Find 
the effective (partially shielded) nuclear charge, and determine the best upper bound 
on E g , for each case. Note: In the case of H~ you should find that {H) > —13.6 eV, 
which would appear to indicate that there is no bound state at all, since it is energet¬ 
ically favorable for one electron to fly off, leaving behind a neutral hydrogen atom. 
This is not entirely surprising, since the electrons are less strongly attracted to the 
nucleus than they are in helium, and the electron repulsion tends to break the atom 
apart. Flowever, it turns out to be incorrect. With a more sophisticated trial wave 


>E. A. Hylleraas, Z. Phys. 65, 209 (1930); C. L. Pekeris, Phys. Rev. 115, 1216 (1959). 
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function (see Problem 7,16) it can be shown that E g < —13.6 eV, and hence tha: 
a bound state does exist. It’s only barely bound, however, and there are no excited 
bound states, 7 so H has no discrete spectrum (all transitions are to and from the 
continuum). As a result, it is difficult to study in the laboratory, although it exists tr, 
great abundance on the surface of the sun. 8 


7.3 THE HYDROGEN MOLECULE ION _ 

Another classic application of the variational principle is to the hydrogen molecule 
ion, Ht", consisting of a single electron in the Coulomb field of two protons (Figure 
7.5). We shall assume for the moment that the protons are fixed in position, a specified 
distance R apart, although one of the most interesting byproducts of the calculation 
is going to be the actual value of R. The Hamiltonian is 



where r\ and rj are the distances to the electron from the respective protons. As 
always, the strategy will be to guess a reasonable trial wave function, and invoke 
the variational principle to get a bound on the ground-state energy. (Actually, our 
main interest is in finding out whether this system bonds at all —that is, whether its 
energy is less than that of a neutral hydrogen atom plus a free proton. If our trial wave 
function indicates that there is a bound state, a better trial function can only make the 
bonding even stronger.) 

To construct the trial wave function, imagine that the ion is formed by taking a 
hydrogen atom in its ground state (Equation 4.80), 

lM r) = e~'l\ [7.36] 

and then bringing in a proton from far away and nailing it down a distance R away. 
If R is substantially greater than the Bohr radius, a, the electron’s wave function 



7 Robert N. Hill, J. Math. Phys. 18, 2316 (1977). 

8 For further discussion, see Hans A. Bethe and Edwin E. Salpeter, Quantum Mechanics ofOne- 
and Two-Electron Atoms (New York: Plenum 1977), Section 34. 
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probably isn’t changed very much. But we would like to treat the two protons on an 
equal footing, so that the electron has the same probability of being associated with 
either one. This suggests that we consider a trial function of the form 

f = A [f g {r\) + f g (r 2 )] - [7.37] 

(Quantum chemists call this the LCAO technique, because we are expressing the 
molecular wave function as a /inear combination of atomic orbitals.) 

Our first task is to normalize the trial function: 


1 = J\f\ 2 d 3 r = \A\ 2 [J\^ g (n)\ 2 d 3 r 

+ J \f g (r2)\ 2 d 3 r+2 j f g (ri)f g (r 2 ) d 3 r], [7.38] 


The first two integrals are 1 (since \jr g itself is normalized); the third is more tricky. 
Let 

[7.39] 


/ = ifg(ri)\tg(n)) = 7-3 j e^ r < +r!>/a d 3 r. 


Picking coordinates so that proton 1 is at the origin and proton 2 is on the z-axis at 
the point R (Figure 7.6), we have 


r\ —r and r 2 = yfr 1 + R 2 - IrRcosO, 


[7.40] 


and therefore 

/ = J- [ e -r/* e -^ + R2-2rR COS 0/a r 2 [7.41] 

JTO 3 J 



Figure 7.6: Coordinates for the calculation of / (Equation 7.39). 
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The (f> integral is trivial (2jr). To do the 6 integral, let 

y = y/r 2 + R 1 — IrR cos 9, so that d(y 2 ) = 2ydy — 2 rR sin 9 dO. 

Then 

f\-^^/ asin0de = ± f r+R e -y/Vydy 
Jo r R J\r-R\ 

= —— \e~ (r+Rya {r + R+a)~ e~ lr - Rl/a (\r - R\ + fl)l. 
rR J 

The r integral is now straightforward: 

I = ^R[.~ e ~ Rfa j R + a ) e ~ 2r/a r dr + e~ R/a J (R - r + a)r dr 

+ e R/a j (r — R + a)e~ lr/a r dr ]. 

Evaluating the integrals, we find (after some algebraic simplification), 


I = e~ R/a 


i+ - 


(++) 


[7.42] 


I is called an overlap integral; it measures the amount by which Jf g (r\) overlaps 
(notice that it goes to 1 as R —> 0, and to 0 as R —^ oo). In terms of I, the 
normalization factor (Equation 7.38) is 




1 


[7-43] 


that 


' 2(1 + 7)' 

Next we must calculate the expectation value of H in the trial state i (r. Noting 


( 


Ti 2 2 e 2 1 \ 

"2^ v -^7o7 i )^ (ri) = E '^ (ri) 


(where E\ — —13.6 eV is the ground-state energy of atomic hydrogen)—and the 
same with r 2 in place of r\ —we have 

~ A [~L v2 ~ 4 ^ + 7 )] [fg(r,) + fs(r2)] 

= Elf ~ A (ik) [^ s (n) + 4^ (r2 )]. 


It follows that 

(//} = £, -2M| 2 


(~r —) + {'h(n)\ — \4'gi.ri)) |. [7.44] 

\4jre 0 / L r 2 r\ ' 
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I’ll let you calculate the two remaining quantities, the so-called direct integral, 

£> = fl |^(r 1 )|2| 1 <r (t (r,)j, [7-45] 

and the exchange integral, 

X=a(^ g (r l )iTi rg (r 2 )j. [7.46] 

The results (see Problem 7.8) are 

D = i _ (' + i) e ~ 1Rla [1A1] 

and 

X= ^14- 

Putting all this together, and recalling (Equations 4.70 and 4.72) that 
E\ = —(e 2 /4neo)(l/2a), we conclude that 

w = [ 1+2 TOt] £ " vw 

According to the variational principle, the ground-state energy is less than {H). Of 
course, this is only the electron’s energy—there is also potential energy associated 
with the proton-proton repulsion: 


;■ 


[7.48] 


V BD = - 


2 a , 


4jreo R 


7 E ,. 


[7.50] 


Thus the total energy of the system, in units of —E\ and expressed as a function of 
x = R/a, is less than 


F(x) = -1 + - 


(1 — (2/3)x 2 )e~ x + (1 + x)e~ 2x 
1 + (1 + x + (l/3)x 2 )e~ x 


[7.51] 


This function is plotted in Figure 7.7. Evidently bonding does occur, for there exists 
a region in which the graph goes below — 1, indicating that the energy is less than that 
of a neutral atom plus a free proton (to wit, —13.6 eV). The equilibrium separation 
of the protons is about 2.4 Bohr radii, or 1.27 °A. 

^Problem 7.8 Evaluate D and X (Equations 7.45 and 7.46). Check your answers 
against Equations 7.47 and 7.48. 


**Problem 7.9 Suppose we used a minus sign in our trial wave function (Equa¬ 
tion 7.37): 


is = A[\fr g (ri) - f g (r 2 )l 


[ 7 . 52 ] 
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Figure 7.7: Plot of the function F(x), Equation 7.51, showing existence of a 
bound state. 


Without doing any new integrals, find F(x) (the analog to Equation 7.51) for this 
case, and construct the graph. Show that there is no evidence of bonding. (Since 
the variational principle only gives an upper bound , this doesn’t prove that bonding 
cannot occur for such a state, but it certainly doesn’t look promising). Note: Actually, 
any function of the form 


f = A[if g (ri) + e l<t> f g {r 2 )\ [7.53] 

has the desired property that the electron is equally likely to be associated with ei¬ 
ther proton. However, since the Hamiltonian (Equation 7.35) is invariant under the 
interchange P:r\ *>r 2 , its eigenfunctions can be chosen to be simultaneously eigen¬ 
functions of P. The plus sign (Equation 7.37) goes with the eigenvalue +1, and the 
minus sign (Equation 7.52) with the eigenvalue —1; nothing is to be gained by con¬ 
sidering the ostensibly more general case (Equation 7.53), though you’re welcome to 
try it, if you’re interested. 


***Problem 7.10 The second derivative of F(x), at the equilibrium point, can be used 
to estimate the natural frequency of vibration (co) of the two protons in the hydrogen 
molecule ion (see Section 2.3). If the ground-state energy of this oscillator 

exceeds the binding energy of the system, it will fly apart. Show that in fact the 
oscillator energy is small enough that this will not happen, and estimate how many 
bound vibrational levels there are. Note : You’re not going to be able to obtain the 
position of the minimum—still less the second derivative at that point—analytically. 
Do it numerically, on a computer. 
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FURTHER PROBLEMS FOR CHAPTER 7 _ 

Problem 7.11 Find the lowest bound on the ground state of hydrogen you can get 
using a gaussian trial wave function 

0(r) = Ae ~ brl , 

where A is determined by normalization and b is an adjustable parameter. Answer. 
— 11.5 eV. 

**Problem 7.12 If the photon had a nonzero mass (m y ^ 0), the Coulomb potential 
would be replaced by a Yukawa potential, of the form 


where p = m Y c/fi. With a trial wave function of your own devising, estimate the 
binding energy of a “hydrogen” atom with this potential. Assume pa <§C 1, and give 
your answer correct to order (pa) 1 . 


Problem 7.13 Suppose you’re given a quantum system whose Hamiltonian Hq 
admits just two eigenstates, \f/ a (with energy E a ), and (with energy E b ). They are 
orthogonal, normalized, and nondegenerate (assume E a is the smaller of the two). 
Now we turn on a perturbation H', with the following matrix elements: 

(ifa\H ! \f a ) = (f b \H f m = 0; (fAH'ltb) = iifb\H r \f a ) = h. [7.55] 


(a) Find the exact eigenvalues of the perturbed Hamiltonian. 

(b) Estimate the energies of the perturbed system using second-order perturbation 
theory. 

(c) Estimate the ground-state energy of the perturbed system using the variational 
principle, with a trial function of the form 

0 = (cos 4>)f a + (sin 4>)fb, [7.56] 

where 0 is an adjustable parameter. (Note that writing the linear combination 
in this way guarantees that 0 is normalized.) 

(d) Compare your answers to (a), (b), and (c). Why is the variational principle so 
accurate in this case? 


Problem 7.14 As an explicit example of the method developed in Problem 7.13, 
consider an electron at rest in a uniform magnetic field B = B z k, for which the 
Hamiltonian is (Equation 4.158): 


[7.57] 
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The eigenspinors, Xa and xt, and the corresponding energies, E a and Et, are given 
in Equation 4.161. Now we turn on a perturbation, in the form of a uniform field in 
the x direction: 


H' = 




[7.58! 


m 


(a) Find the matrix elements of H', and confirm that they have the structure of 
Equation 7.55. What is hi 

(b) Using your result in Problem 7.13(b), find the new ground-state energy, in 
second-order perturbation theory. 

(c) Using your result in Problem 7.13(c), find the variational principle bound on 
the ground-state energy. 


***Problem 7.15 Although the Schrodinger equation for helium itself cannot be 
solved exactly, there exist “helium like” systems that do admit exact solutions. A 
simple example 9 is “rubber-band helium”, in which the Coulomb forces are replaced 
by Hooke’s law forces: 


H = + v 2 ) + \mo> 2 (r\ + r\) - \ma> 2 |r, - r 2 | 2 . 

2m 2 4 


[7.59] 


(a) Show that the change of variables from iq, r 2 , to 

1 1 
u = — (ri+r 2 ), v=—(ri-r 2 ) 

V2 V2 


[7.60] 


turns the Hamiltonian into two independent three-dimensional harmonic oscil¬ 
lators: 


H = [ _ ^n V “ + \ mc ° 2u2 ~\ + [ _ ^ V ^ + t 7 - 61 ! 

(b) What is the exact ground-state energy for this system? 

(c) If we didn’t know the exact solution, we might be inclined to apply the method 
of Section 7.2 to the Hamiltonian in its original form (Equation 7.59). Do so 
(but don’t bother with shielding). How does your result compare with the exact 
answer? Answer: (H) =3Tia)(\ — A./4). 


***ProbIem 7.16 In Problem 7.7 we found that the trial wave function with shielding 
(Equation 7.27), which worked well for helium, is inadequate to confirm the existence 


9 For a more sophisticated model, see R. Crandall, R. Whitnell, and R. Bettega, Am. J. Phys. 52. 
438(1984). 
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of a bound state for the negative hydrogen ion. Chandrasekhar 10 used a trial wave 
function of the form 

\f(r u r 2 ) = A[yf\{r x )\jf 2 {r 2 ) + [7.62] 


where 


f\ (r) = 



~ z ' r / a , and xf/ 2 (r) = 



[7.63] 


In effect, he allowed two different shielding factors, suggesting that one electron 
is relatively close to the nucleus and the other is farther out. (Because electrons 
are identical particles, the spatial wave function must be symmetrized with respect 
to interchange. The spin state—which is irrelevant to the calculation—is evidently 
antisymmetric.) Show that by astute choice of the adjustable parameters Z\ and Z 2 
you can get ( H) less than —13.6 eV. Answer, 


(H) = -Zl— (-x 8 + 2x 7 + I* 6 / - -X s / - F 3 / + —jr/ - I/), 

' ' *<> + /( 2 7 2 7 8 7 8 7 2 * )' 


where x = Z x + Z 2 and y = 'l«JZ\Z 2 , Chandrasekhar used Z\ = 1.039 (since this 
is larger than 1, the motivating interpretation as an effective nuclear charge cannot 
be sustained, but never mind—it’s still an acceptable trial wave function) and Z 2 — 
0.283. 


Problem 7.17 The fundamental problem in harnessing nuclear fusion is getting the 
two particles (say, two deuterons) close enough together for the attractive (but short- 
range) nuclear force to overcome the Coulomb repulsion. The “brute force” method is 
to heat the particles to fantastic temperatures and allow the random collisions to bring 
them together. A more exotic proposal is muon catalysis, in which we construct a 
“hydrogen molecule ion”, only with deuterons in place of protons, and a muon in place 
of the electron. Predict the equilibrium separation distance between the deuterons in 
such a structure, and explain why muons are superior to electrons for this purpose. 11 


10 S. Chandrasekhar, Astrophys. J. 100, 176 (1944). 

11 The classic paper on muon-catalyzed fusion is J. D. Jackson, Phys. Rev. 106, 330 (1957); for a 
recent popular review, see J. Rafelski and S. Jones, Scientific American, November 1987, page 84. 



CHAPTER 8 


THE WKB 
APPROXIMATION 


The WKB (Wentzel, Kramers, Brillouin) 1 method is a technique for obtaining ap¬ 
proximate solutions to the time-independent Schrodinger equation in one dimension 
(the same basic idea can be applied to many other differential equations, and to the 
radial part of the Schrodinger equation in three dimensions). It is particularly useful 
in calculating bound-state energies and tunneling rates through potential barriers. 

The essential idea is as follows: Imagine a particle of energy E moving through 
a region where the potential V (x) is constant. If E > V, the wave function is of the 
form 


\j/(x) = Ae ±lkx , with k = yflm(E — V)fh. 

The plus sign indicates that the particle is traveling to the right, and the minus sign 
means it is going to the left (the general solution, of course, is a linear combination 
of the two). The wave function is oscillatory, with constant wavelength X = 2n(k 
and constant amplitude A. Now suppose that V(x) is not constant, but varies rather 
slowly in comparison to X, so that over a region containing many full wavelengths 
the potential is essentially constant. Then it is reasonable to suppose that f remains 
practically sinusoidal, except that the wavelength and the amplitude change slowly 
with x. This is the inspiration behind the WKB approximation. In effect, it identi¬ 
fies two different levels of x-dependence: rapid oscillations, modulated by gradual 
variation in amplitude and wavelength. 


'in Holland it’s KWB, in France it’s BWK, and in England it’s JWKB {for Jeffreys). 
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By the same token, if E < V (and V is constant), then i p is exponential: 

\p(x) = Ae ±KX , with k = yj2m(V — E)/Ti. 

And if V (x) is not constant, but varies slowly in comparison with 1 /k, the solution 
remains practically exponential, except that A and k are now slowly varying functions 
of x. 

Now, there is one place where this whole program is bound to fail, and that is 
in the immediate vicinity of a classical turning point, where E ^ V. For here A (or 
\/k) goes to infinity, and V (x) can hardly be said to vary “slowly” in comparison. 
As we shall see, a proper handling of the turning points is the most difficult aspect 
of the WKB approximation, though the final results are simple to state and easy to 
implement. 


8.1 THE “CLASSICAL” REGION 


The Schrodinger equation, 


Ti 2 d 2 \p 


can be rewritten in the following way: 


where 


d 2 xp 

dx 2 



[ 8 . 1 ] 


P(x) = m[E~V(x)] [8.2] 

is the classical formula for the momentum of a particle with total energy E and 
potential energy V (x). For the moment, I’ll assume that E > V (x), so that p(x) is 
real, we call this the “classical” region, for obvious reasons—classically the particle 
is confined to this range of x (see Figure 8.1). In general, ip is some complex function; 
we can express it in terms of its amplitude, A (x), and its phase, 0 (x)—both of which 
are real: 


\p(x) = A(x)e i(pix) . [8.3] 

Using a prime to denote the derivative with respect to x, we find 

ip = (A’ + iA4>')e 1 *, 


and 
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Figure 8.1: Classically, the particle is confined to the region where E > V(x). 

= [A" + 2iA'<t>' + i A<j>" - 4(0') 2 ]. [8.4] 

dx l 

Putting this into Equation 8.1, 

p 2 

A" + 2iA'4> + iA<j >" — A((f)') 2 = — j^A. [8.5] 

This is equivalent to two real equations, one for the real part and one for the imaginary 
part: 

A" - A{(t>') 2 = -^2 A, or A" = A J^(0') 2 - [8.6] 


and 


2 A'(f> r + A(f}" = 0, or ( A 2 4>') > = 0. [8.7] 

Equations 8.6 and 8.7 are entirely equivalent to the original Schrodinger equa¬ 
tion. The second one is easily solved: 


A 2 4> ! = C 2 , or A 


C 

>/r 


[ 8 - 8 ] 


where C is a (real) constant. The first one (Equation 8.6) cannot be solved in general— 
so here comes the approximation: We assume that the amplitude A varies slowly, so 
that the A" term is negligible. (More precisely, we assume that A"lA is much less 
than both (<//) 2 and p 2 fh 2 .) In that case we can drop the left side of Equation 8.6, 
and we are left with 


m 2 = 


El 

h 2 


or 


d± = p 
dx r 
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and therefore 

<f>(x) = J p(x)dx. [8.9] 

(I’ll write this as an indefinite integral, for now—any constant of integration can be 
absorbed into C, which thereby becomes complex.) It follows, then, that 


IK*) = 


C c ±k f P(x)dx 


[ 8 . 10 ] 


and the general (approximate) solution will be a linear combination of two such terms, 
one with each sign. 

Notice that 

ICI 2 

m*)\ 2 = V-, [8.ii] 

P(x) 

which says that the probability of finding the particle at point x is inversely pro¬ 
portional to its (classical) momentum (and hence its velocity) at that point. This is 
exactly what you would expect—the particle doesn’t spend long in the places where 
it is moving rapidly, so the probability of getting caught there is small. In fact, the 
WKB approximation is sometimes derived by starting with this “semiclassical” ob¬ 
servation, instead of by dropping the A" term in the differential equation. The latter 
approach is cleaner mathematically, but the former offers a more plausible physical 
rationale. 


Example: Potential well with two vertical walls. Suppose we have an 
infinite square well with a bumpy bottom (Figure 8.2): 


V(x) = 


some specified function, if 0 < x < a, 
oo, otherwise. 


[ 8 . 12 ] 



Figure 8.2: Infinite square well 
with a bumpy bottom. 
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Inside the well [assuming E > V (x) throughout] we have 

fix) = -A= [C + e‘* (x) + C_<T*«], 
yjpix) 

or, more conveniently, 

[Cl Sin0(x) + C 2 cos0(x)], [8.13; 

vTO 

where (exploiting the freedom noted earlier to impose a convenient lower limit on the 
integral) 

<£(*) = p(x')dx'. [8.14; 

Now 0(x) must go to zero at x = 0, so, since 0(0) = 0, C 2 = 0. Also, 0(x) goes 
to zero at x = a, so 


< p(a ) —rm (n = 1, 2, 3 ...). 


[8.15] 


Conclusion: 



[8.16] 


This quantization condition is our main result; it determines the (approximate) allowed 
energies. 

For instance, if the well has a fiat bottom [V (x) = 0], then p(x ) = \JlmE (a 
constant), and Equation 8.16 says pa — nnh, or 

nVh 2 
E" 2m a 2 

which are precisely the energy levels of the original infinite square well (Equa¬ 
tion 2.23). In this case the WKB approximation yields the exact answer (the am¬ 
plitude of the true wave function is constant , so dropping A" cost us nothing). 


❖Problem 8.1 Use the WKB approximation to find the allowed energies (E„ ) ot 
an infinite square well with a “shelf”, of height V 0 , extending half-way across (see 
Figure 6.3): 
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Vo, if 0 < x < a/2, 

V(x) = < 0, if a/2 < x < a, 

oo, otherwise. 

Express your answer in terms of Vo and E® = (nnh) 2 flma 2 (the nth allowed energy 
for the “unperturbed” infinite square well, with no shelf). Assume that E ® > Vo, but 
do not assume that E n » V 0 . Compare your result with what we got in Section 6.1.2, 
using first-order perturbation theory. Note that they are in agreement if either Vo is 
very small (the perturbation theory regime) or n is very large (the semiclassical WKB 
regime). 

** Problem 8.2 An illuminating alternative derivation of the WKB formula (Equation 
8.10) is based on an expansion in powers of h. Motivated by the free particle wave 
function, ifs — A exp (ztipx/h), we write 

ir(x) = e if{x)/ \ 

where /(x) is some complex function. (Note that there is no loss of generality here— 
any nonzero function can be written in this way.) 

(a) Put this into Schrodinger’s equation (in the form of Equation 8.1), and show 
that 

/*/" - (ff + P 2 = o. 

(b) Write /(x) as a power series in h: 

fix) - fo(x) +tifi(x) +h 2 f 2 (x) H-, 

and, collecting like powers of h, show that 

(/o) 2 = p 2 . i/o'= 2/o7,', i/i" = 2 / 0/2 + (f[) 2 . etc. 

(c) Solve for f 0 (x) and /, (x), and show that—to first order in h —you recover 
Equation 8.10. 

Note : The logarithm of a negative number is defined by ln(—z) = ln(z) + inn , where 
n is an odd integer. If this formula is new to you, try exponentiating both sides, and 
you’ll see where it comes from. 
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8.2 TUNNELING _ 


So far, I have assumed that E > V, so that p(x) is real. But we can easily write down 
the corresponding result in the ncwclassical region (E < V )—it’s the same as before 
(Equation 8.10), only now p(x) is imaginary 2 -. 


fix) = 


__£_ p ±l f \p{x)\dx 

VLp(UI 


[8.17] 


Consider, for example, the problem of scattering from a rectangular barrier with 
a bumpy top (Figure 8.3). To the left of the barrier (x < 0), 


fix) = Ae ikx + Be~ ikx 


[8.18] 


where A is the incident amplitude, B is the reflected amplitude, and k = y/2mE/h 
(see Section 2.7). To the right of the barrier (x > a), 


f{x) = Fe ikx ; 

F is the transmitted amplitude, and the tunneling probability is 

T= \F? 

Ml 2 ' 

In the tunneling region (0 < x < a), the WKB approximation gives 


[8.19] 


[ 8 . 20 ] 



2 In this case the wave function is real, and the analogs to Equations 8.6 and 8.7 do not follow 
necessarily from Equation 8.5, although they are still sufficient. If this bothers you, study the alternate e 
derivation in Problem 8.2. 
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Figure 8.4: Qualitative structure of the wave function, for scattering from a high, 
broad barrier. 


ir(x) = 


C 



| /; \p(x')\dx' 


+ 





[ 8 . 21 ] 


But if the barrier is very high and/or very wide (which is to say, if the probability of 
tunneling is small), then the coefficient of the exponentially increasing term (C) must 
be small (in fact, it would be zero if the barrier were infinitely broad), and the wave 
function looks something like 3 Figure 8.4. The relative amplitudes of the incident and 
transmitted waves are determined essentially by the total decrease of the exponential 
over the nonclassical region: 


so that 


\I1 ~ e -*f a a \P^\ dx ' 

Ml 


T = e 2y , with 


-if 


\p(x)\dx. 


[ 8 . 22 ] 


Example: Gamow’s theory of alpha decay. In 1928, George Gamow (and, 
independently, Condon and Gurney) used this result to provide the first theoretical 
account of alpha decay (the spontaneous emission of an alpha particle—two protons 
and two neutrons—by certain radioactive nuclei). Since the alpha particle carries a 
positive charge ( 2e ), it will be electrically repelled by the leftover nucleus (charge Z e) 
as soon as it gets far enough away to escape the nuclear binding force. Gamow pictured 
the potential energy curve for the alpha particle as a finite square well (representing 
the attractive nuclear force), extending out to r\ (the radius of the nucleus), joined 
to a repulsive Coulombic tail (Figure 8.5). If E is the energy of the emitted alpha 
particle, the outer turning point (r 2 ) is determined by 


*This heuristic argument can be made more rigorous—see Problem 8,10. 
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Figure 8.5: Gamow’s model for the potential energy of an alpha particle in a 
radioactive nucleus. 


4 7ze 0 r 2 

The exponent y (Equation 8.22) is evidently 4 


[8.23] 


V 



2 Ze 2 


-E^Jdr = 




r 


= [^2 cos 1 y/r\/r 2 - y/ri(r 2 - n)] ■ [8-24] 

Typically, rj «r 2 , and we can simplify this result. The argument of the inverse 
cosine is close to zero, so the angle itself is close to jt/2. Call it 9 = {n/2) - e : 
then 


cos 0 = cos(^/2) cos e + sin(?r/2) sin e = sin e = e, 


and hence 


4 In this case the potential does not drop to zero on both sides of the barrier (moreover, this is reall> 
a three-dimensional problem), but the essential inspiration, contained in Equation 8.22, is all we realh 
need. 
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Thus 


\J2.fn E r 7i _ 1 /. j - 

Y = —^— [ 2 r2 “ 2 VVj = K \ - K 2 , [8.25] 

where 

Ki = (-^—) 71 = 1.980MeV 1/2 , [8.26] 

\4^6 0 y h 

and 

/ e 2 \ lf2 A«Jm xn 

K 2 = {- - - 7 — = 1-485 fm~ 1/2 . [8.27] 

\ 4^ e 0 / h 

(One fermi, fm, is 10“ 15 m, which is about the size of a typical nucleus.) 

If we imagine the alpha particle rattling around inside the nucleus, with an 
average velocity v, the average time between “collisions” with the “wall” is about 
2 rifv, and hence the frequency of collisions is vjlr\. The probability of escape at 
each collision is e~ 2y , so the probability of emission, per unit time, is (v/2rj)e~ 2y , 
and hence the lifetime of the parent nucleus is about 



v 


[8.28] 


Unfortunately, we don’t know v —but it hardly matters, for the exponential factor 
varies over a. fantastic range (25 orders of magnitude) as we go from one radioac¬ 
tive nucleus to another; relative to this the variation in v is pretty insignificant. In 
particular, if you plot the logarithm of the experimentally measured lifetime against 
l/VE (related to y by Equation 8.25), the result is a beautiful straight line (Figure 
8 . 6 ), confirming that the lifetime of an alpha emitter is governed by the difficulty of 
penetrating the Coulomb barrier. 


❖Problem 8.3 Use Equation 8.22 to calculate the approximate transmission proba¬ 
bility for a particle of energy E that encounters a finite square barrier of height Vo > E 
and width 2a. Compare the exact result (Prob. 2.32) in the WKB regime f«l. 

❖❖Problem 8.4 Calculate the lifetimes of U 238 and Po 212 , using Equation 8.28, with 
Equation 8.25 for y. Hint : The density of nuclear matter is relatively constant (i.e., 
the same for all nuclei), so (ri ) 3 is proportional to A (the number of neutrons plus 
protons). Empirically, 


r x = (1.07 fm)A 1/3 . [8.29] 

The energy of the emitted alpha particle is determined by Einstein’s formula 
(E = me 2 ): 
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E [MeV] 


Figure 8.6: Graph of the logarithm of the lifetime versus l / Qe, for several alpha 
emitters. From David Park, Introduction to the Quantum Theory, 3rd ed. (New 
York: McGraw-Hill, 1992). (See acknowledgment in Preface.) 


E — m p c 2 — mdC 2 — m a c 2 , [8.30] 

where m p is the mass of the parent nucleus, m d is the mass of the daughter nucleus, 
and m a is the mass of the alpha particle (which is to say, the He 4 nucleus). To figure 
out what the daughter nucleus is, note that the alpha particle carries off two proton" 
and two neutrons, so Z decreases by 2 and A by 4. Look up the relevant nuclear 
masses. To estimate v, use E = (1/2 )m a v 2 \ this ignores the (negative) potential 
energy inside the nucleus, and surely underestimates v, but it’s about the best we can 
do at this stage. Incidentally, the experimental lifetimes are 6 x 10 9 years and 0.5 /rs. 
respectively. 


8.3 THE CONNECTION FORMULAS _ 

In the discussion so far I have assumed that the “walls” of the potential well (or 
the barrier) are vertical, so that the “exterior” solution is simple and the boundan 
conditions trivial. As it turns out, our main results (Equations 8.16 and 8.22) are 
reasonably accurate even when the edges are not so abrupt (indeed, in Gamow’s theory 
they were applied to just such a case). Nevertheless, it is of some interest to stud> 
more closely what happens to the wave function at a turning point (E = V), where 
the “classical” region joins the “nonclassical” region and the WKB approximation 
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itself breaks down. In this section I’ll treat the bound-state problem (Figure 8.1); you 
get to do the scattering problem for yourself (Problem 8.10). 5 

For simplicity, let’s shift the axes over so that the right-hand turning point occurs 
at x = 0 (Figure 8.7). In the WKB approximation, we have 


tKx) = 


_L_ [ Be * f* p(x ' )dx ’ + Ce~ * /. p(x ' )dx ' 1 

/^>L J’ 

_J_ De~* fv lp(x ' )ldx ' 


J\P(X)\ 


if x < 0, 


if x > 0. 


[8.31] 


[Assuming that V(x) remains greater than E for all x > 0, we can exclude the 
positive exponent in this region, because it blows up as x —► oo.] Our task is to 
join the two solutions at the boundary. But there is a serious difficulty here: In the 
WKB approximation, \}r goes to infinity at the turning point, where p(x) -» 0. The 
true wave function, of course, has no such wild behavior—as anticipated, the WKB 
method simply fails in the vicinity of a turning point. And yet, it is precisely the 
boundary conditions at the turning points that determine the allowed energies. What 
we need to do, then, is splice the two WKB solutions together, using a “patching” 
wave function that straddles the turning point. 

Since we only need the patching wave function ($ p ) in the neighborhood of the 
origin, we’ll approximate the potential by a straight line : 


V(x) = E + V'(0)x, 


[8.32] 


and solve the Schrodinger for this linearized V: 


Linearized 

potential 



Figure 8.7: Enlarged view of the right-hand turning point. 


1 Warning: The following argument is quite technical, and you may wish to skip it on a first reading. 
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ft 2 d 2 jr p 
2m dx 2 


+ {E + V'XO)x)^ p = Eir p . 


d 2 ir p 3 

where 

The a’s can be absorbed into the independent variable by defining 


so that 


d 2 ty P 

dz 2 


= zir p . 


[8.33] 


[8.34] 


[8.35] 


[8.36] 


This is Airy’s equation, and the solutions are called Airy functions . 6 Since the Airy 
equation is a second-order differential equation, there are two linearly independent 

Table 8.1: Some properties of the Airy functions. 


Differential Equation: 


d 2 y 

lT 2 = zy - 

Linear combinations of Airy Functions, Ai(z ) and Bi(z). 


1 /V \ 

Integral Representation: Ai(z) = — / cos | -b sz I ds 

*h V 3 ) 

Bi(z) = —J + sin j a 


Asymptotic Forms: 


Ai(z) - 


Bi(z) - 


Ifftt z 1/4 


1 


■yfn z ] A 


Z» 0 


Ai(z) - 


Bi(z) - 




ffjF(-z ) 1 / 4 

1 


6 Classically , a linear potential means a constant force, and hence a constant acceleration—the 
simplest nontrivial motion possible, and the starting point for elementary mechanics. It is ironic that the 
same potential in quantum mechanics gives rise to unfamiliar transcendental functions, and plays only a 
peripheral role in the theory. 
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Figure 8.8: Graph of the Airy functions. 
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Airy functions, Ai (z) and Bi (z ); the general solution is a linear combination of these. 
Ai and Bi are related to Bessel functions of order 1/3; some of their properties are 
listed in Table 8.1 and they are plotted in Figure 8.8. Evidently the patching wave 
function is 

\fr p {x) = aAi(ax ) + bBi(ax), [8.37] 

for appropriate constants a and b. 

Now \}rp is the (approximate) wave function in the neighborhood of the origin; 
our job is to match it to the WKB solutions in the overlap regions on either side 
(see Figure 8.9). These overlap zones are close enough to the turning point that the 
linearized potential is reasonably accurate (so that \fr p is a good approximation to the 
true wave function), and yet far enough away from the turning point that the WKB 
approximation is reliable. 7 In the overlap regions Equation 8.32 holds, and therefore 
(in the notation of Equation 8.34) 

p(x) = y/2m(E-E-V'( 0)jc) = ha 3/2 ^f^. [8.38] 

In particular, in overlap region 2, 


7 This is a delicate double constraint, and it is possible to concoct potentials so pathological that no 
such overlap region exists. However, in practical applications this seldom occurs. See Problem 8.8. 
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Overlap 1 


-Vp- 


Kti 


WVKB 


ummg 0ver | ap 2 
point 


Patching 

region 

Figure 8.9: Patching region and the two overlap zones. 


f \p(x')\dx' = ha 3 ^ 2 f 

Jo Jo 

and therefore the WKB wave function (Equation 8.31) can be written a 


\p{x')\dx r = ha 3/2 / VJddx f = -h(ax ) 3/2 , 
Jo 3 


f(x) = 


Vho( 3 ^x 1 / 4 




[8.39] 


Meanwhile, using the large-z asymptotic forms 8 of the Airy functions (from Table 
8.1), the patching wave function (Equation 8.37) in overlap region 2 becomes 


f P (x) = ; 


l^/niaxy / 4 
Comparing the two solutions, we see that 
I An 


-\{axf 2 + 


*Jn{ax) X l A 


\{axf 2 


ei 


a = y—D, and b = 0. 
ah 


[8.40] 


[8.41] 


Now we go back and repeat the procedure for overlap region 1. Once again. 
p(x) is given by Equation 8.38, but this time x is negative, so 


r° 2 

j p(x f ) dx’ = -h(-axf !2 

and the WKB wave function (Equation 8.31) is 


[8.42] 


8 At first glance it seems absurd to use a large-z approximation in this region, which after all is 
supposed to be reasonably close to the turning point at z = 0 (so that the linear approximation to the 
potential is valid). But notice that the argument here is ax, and if you study the matter carefully (see 
Problem 8.8) you will find that there is (typically) a region in which ax is large, but at the same time it is 
reasonable to approximate V (x) by a straight line. 
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ir(x) = —-!- \ Be 1 ^- 01 ^ 2 + Ce-'1 ( -“ x)3/2 1. 

V^a 3/4 (—x) 1 / 4 L J 


[8.43] 


Meanwhile, using the asymptotic form of the Airy function for large negative z 
(Table 8.1), the patching function (Equation 8.37, with b = 0) reads 


■[!<-»> 13/2 +? 


*' (x) = ^(-« x yi‘ sSn [l { - ax),l ‘ + '- 

__L jV^V _ e -injA e -i\(-ax)" > l 1 ']^ 


[8.44] 


A /i : (—ax) 1 / 4 2 i I 

Comparing the WKB and patching wave functions in overlap region 1, we find 

a ^ e 171 ^ = —^= and -— e~' l7tl4 = -y=, 

2iy/n y/Tia 2iy/7i ^/fia 


or, putting in Equation 8.41 for a, 

B = -ie i7T/4 D, and C = ie~ i7Z/4 D. 


[8.45] 


These are the so-called connection formulas, joining the WKB solutions at either 
side of the turning point. We’re done with the patching wave function now—its 
only purpose was to bridge the gap. Expressing everything in terms of the one 
normalization constant D, and shifting the turning point back from the origin to an 
arbitrary point X 2 , the WKB wave function (Equation 8.31) becomes 


t/t(x) = 


■sin[±/; ! PU')dx' + 

-if'w’W 


if x < X 2 ; 
if x > X 2 . 


[8.46] 


Example: Potential well with one vertical wall. Imagine a potential well 
that has one vertical side (at x = 0) and one sloping side (Figure 8.10). In this case 
•ft (0) = 0, so Equation 8.46 says 


1 

h 


p(x)dx + — = njt, 


0 n - 1, 2, 3,...), 


or 


I, 


p(x) dx = 



[8.47] 


For instance, consider the “half-harmonic oscillator”, 
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Figure 8.10: Potential well with one vertical wall. 


In this case 


where 


V(x) = J \mo) 2 x 2 , ifx > 0, 

I 0, otherwise. 


p(x) = y/2m[E — (1/2 )mo) 2 x 2 ] = mco^Jx 2 - x 2 , 


[8.48] 


is the turning point. So 


I p(x)dx=mwi Jxl — x 2 dx =—mo)xl 

Jo Jo v 4 

and the quantization condition Equation 8.47 yields 


71E 
2 co ’ 


E„ = (in -~^hw= Q, n ~, y,.. ha). [8.49] 

In this particular case the WKB approximation actually delivers the exact allowed 
energies (which are precisely the odd energies of the full harmonic oscillator—see 
Problem 2.38). 


Example: Potential well with no vertical walls. Equation 8.46 con¬ 
nects the WKB wave functions at a turning point where the potential slopes upward 
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(Figure 8.11a); the same reasoning, applied to a downward -sloping turning point 
(Figure 8.1 lb), yields (Problem 8.9) 





f*' I/>(*')! dx’ 


if x < jci; 


ID' . T1 f x 
. sin - / 

Vp(x) U Jx x 


p(x') dx' 




if x > jcj . 


[8.50] 


In particular, if we’re talking about a potential well (Figure 8.1 lc), the wave function 
in the “interior” region (xi < x < x 2 ) can be written either as 


\fr(x)= _ : sinfl 2 0O* where 0 2 (x) = - [ p{x')dx '— 

y/p(x) h J x 4 

(Equation 8.46), or as 

}fr(x) = sin^(x), where 0i(x) = — \ f p{x')dx' - % 

JW) * J Xl 4 

(Equation 8.50). Evidently the arguments of the sine functions must be equal, 
modulo 7i : 9 0 2 = Q\ + nn, from which it follows that 


f x2 ( 

\ 

I p(x) dx ~ln-- 

| nh, with n = 1, 2, 3,- 


This quantization condition determines the allowed energies for the “typical” 
case of a potential well with two sloping sides. Notice that it differs from the formulas 


9 Not 2n —an overall minus sign can be absorbed into the normalization factors D and D'. 
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for two vertical walls (Equation 8.16) or one vertical wall (Equation 8.47) only in 
the number that is subtracted from n (0, 1/4, or 1/2). Since the WKB approximation 
works best in the semiclassical (large n ) regime, the distinction is more in appearance 
than in substance. In any event, the result is extraordinarily powerful, for it enables 
us to calculate (approximate) allowed energies without ever solving the Schrodinger 
equation , by simply evaluating one integral. The wave function itself has dropped 
out of sight. 


** Problem 8.5 Consider the quantum mechanical analog to the classical problem of 

a ball (mass m ) bouncing elastically on the floor. 

(a) What is the potential energy, as a function of height x above the floor? (For 
negative x, the potential is infinite —the ball can’t get there at all.) 

(b) Solve the Schrodinger equation for this potential, expressing your answer in 
terms of the appropriate Airy function [note that Bi(z ) blows up for large z, and 
hence does not yield a normalizable wave function]. Don’t bother to normalize 
*(*). 

(c) Using g = 9.80 m/s 2 and m — 0.100 kg, find the first four allowed energies, 
in Joules, correct to three significant digits. Hint : see Milton Abromowitz 
and Irene A. Stegun, Handbook of Mathematical Functions (New York: Dover 
1970), page 478; the notation is defined on page 450. 

(d) What is the ground state energy, in eV, of an electron in this gravitational field? 
How high off the ground is this electron, on the average? Hint : Use the virial 
theorem to determine (jc). 


❖Problem 8.6 Analyze the bouncing ball (Problem 8.5) using the WKB approxima¬ 
tion. 

(a) Find the allowed energies E n in terms of m , g, and h. 

(b) Now put in the particular values given in Problem 8.5(c), and compare the WKB 
approximation to the first four energies with the “exact” results. 

(c) About how large would the quantum number n have to be to give the ball an 
average height of, say, 1 meter above the ground? 


❖Problem 8.7 Use the WKB approximation to find the allowed energies of the 
harmonic oscillator. 


Problem 8.8 Consider a particle of mass m in the nth stationary state of the har¬ 
monic oscillator (angular frequency m). 


(a) Find the turning point X 2 . 
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(b) How far id) could you go above the turning point before the error in the linearized 
potential (Equation 8.32, but with the turning point at x 2 ) reaches 1%? That is, 
if 

V(x 2 +d)-V lin (x 2 + d) _ Qoi 
V(x 2 ) ' ’ 

what is d? 

(c) The asymptotic form of Ai(z ) is accurate to 1% as long as z > 5. For the d in 
part (b), determine the smallest n such that ad >5. (For any n larger than this, 
there exists an overlap region in which the linearized potential is good to 1% 
and the large-z form of the Airy function is good to 1%.) 

**Problem 8.9 Derive the connection formulas at a downward-sloping turning point, 
and confirm Equation 8.50. 

***Problem 8.10 Use appropriate connection formulas to analyse the problem of 
scattering from a barrier with sloping walls (Figure 8.12). Begin by writing the WKB 
wave function in the form 


M*) = 


: < xO; 


L J 

. \ Ce i /,; ^ + De -i x; , ( „ < x < X2); t 8 - 52 ' 
vW |_ J 






(x > x 2 ). 


Do not assume C ~0. Calculate the tunneling probability, T — \ F\ 2 /\A | 2 , and show 
that your result reduces to Equation 8.22 in the case of a broad, high barrier. 



Figure 8.12: Barrier with sloping walls. 
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FURTHER PROBLEMS FOR CHAPTER 8 


**Problem 8.11 Use the WKB approximation to find the allowed energies of the 
general power-law potential: 


V(x) =a\x\\ 


where v is a positive number. Check your result for the case v = 2. Answer: 


E n = a 


(n - 1/2)72 


/ZTEii+i) 

V 2 ma r (i + 1) 



[8.53] 


**Problem 8.12 Use the WKB approximation to find the bound-state energy for the 
potential in Problem 2.48. Compare the exact answer. Answer: 

-[(9/8) - (1/V2)])iV/m. 


Problem 8.13 For spherically symmetrical potentials, we can apply the WKB ap- 
proximation to theradial equation, (Equation 4.37). In thecas e/ = 0, it is reasonable 10 
to use Equation 8.47 in the form 

pro 

/ p(r)dr = (n- 1/4)j zTi, [8.54] 

Jo 

where r 0 is the turning point (in effect, we treat r = 0 as an infinite wall). Apply this 
formula to estimate the allowed energies of a particle in the logarithmic potential 

V(r) = U 0 ln (r/a) 

(for constants Vq and a). Treat only the case / = 0. Show that the spacing between 
the levels is independent of mass. Partial answer: 

= V 0 In ('—A). 
_\« - 1/4/ 

**Problem 8.14 Use the WKB approximation in the form 

f p(r)dr = (n-l/2)nh [8.55] 

Jr\ 


Application of the WKB approximation to the radial equation raises some delicate and subtle 
problems, which I will not go into here. The classic paper on the subject is R. Langer Phys Rev 51 669 
(1937). 
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Figure 8.13: Symmetric double well; Problem 8.15. 


to estimate the bound state-energies for hydrogen. Don’t forget the centrifugal term 
in the effective potential Equation 4.38. The following integral may help; 


f 


a/(* ~a)(b - x) = -{sfb - *Ja) 2 . 


Note that you recover the Bohr levels when n » / and n '» 1/2. Answer: 

-13.6 eV 


Em = 


[n - (1/2) + vTlT+T)] 2 ' 


[8.56] 


[8.57] 


***Problem 8.15 Consider the case of a symmetrical double-well, such as the one 
pictured in Figure 8.13. We are interested in bound states with E < V (0). 

(a) Write down the WKB wave functions in regions (i) x > x 2 , (ii) *i < x < x 2 , 
and (iii) 0 < v < x\. Impose the appropriate connection formulas at x\ andx 2 
(this has already been done, in Equation 8.46, for x 2 ; you will have to work out 
x\ for yourself), to show that 



e 


£ \p(x')\dx' 


nx) = 



p{x') dx r 


+ 


3 


(i) 


(ii) 



|2cos^r ^ dx ' +sin^r lp(x ' )ldx ' 


(iii) 


where 
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(b) Because V (x) is symmetric, we need only consider even (+) and odd (-) wave 
functions. In the former case 0'(O) = 0, and in the latter case 0(0) = 0. Show 
that this leads to the following quantization condition: 


tan# = ±2e (p , 


where 


0 = 


■u: 


\p(x')\dx’. 


[8.59] 


[8.60] 


Equation 8.59 determines the (approximate) allowed energies (note that E comes 
into X] and X 2 , so 0 and (ft are both functions of E). 

(c) We are particularly interested in a high and/or broad central barrier, in which 
case (ft is large and e? is huge. Equation 8.59 then tells us that 9 must be very 
close to a half-integer multiple of 7T. With this in mind, write 0 = (w+l/2)jr+e, 
where \e\ <<C 1, and show that the quantization condition becomes 


0s(„ + I) 

(d) Suppose each well is a parabola 11 : 


71 T 


V(x)-. 


^mco 2 (x + a) 2 , if jc < 0, 
\ma) 2 (x - a) 2 , if x > 0. 

Sketch this potential, find 9 (Equation 8.58), and show that 


£ -*H) 


* bco 
no) =}= —e 

2 Jt 


[8.61] 


[8.62] 


[8.63] 


Note: If the central barrier were impenetrable ((ft ->• oo), we would simply have 
two detached harmonic oscillators, and the energies E n ~ (n +1 /2)ft a) would be 
doubly degenerate, since the particle could be in the left well or in the right one. 
When the barrier becomes finite, putting the two wells into “communication”, 
the degeneracy is lifted. The even states (0^") have slightly lower energy, and 
the odd ones (f~) have slightly higher energy. 


11 Even if V (x ) is not strictly parabolic in each well, this calculation of 6, and hence the result (Equa¬ 
tion 8.63), will be approximately correct, in the sense discussed in Section 2.3, with <o = y/v ,f (xo)fm. 
where xq is the position of the minimum. 
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(e) Suppose the particle starts out in the right well—or, more precisely, in a state 
of the form 


which, assuming the phases are picked in the “natural” way, will be concentrated 
in the right well. Show that it oscillates back and forth between the wells, with 
a period 



to 


(f) Calculate </>, for the specific potential in part (d), and show that for V (0) E, 
<p ~ mtoa 2 jh. 



CHAPTER 9 


TIME-DEPENDENT 
PERTURBATION THEORY 


Up to this point, practically everything we have done belongs to the subject that 
might properly be called quantum statics, in which the potential energy function is 
independent of time: V (r, t) = V (r). In that case the (time-dependent) Schrodinger 
equation, 

HV = ih —, 
dt 

can be solved by separation of variables: 

'I'Cr, t) = ir(r)e~ iEt/Tl , 

where VUr) satisfies the time-independent Schrodinger equation, 

Hf = Eyfr. 

Because the time dependence of is carried by the exponential factor (e~ l Et!h ), which 
cancels out when we construct the physically relevant quantity |vp| 2 , all probabilities 
and expectation values are constant in time. By forming linear combinations of these 
stationary states we obtain wave functions with more interesting time dependence, 
but even then the possible values of the energy, and their respective probabilities, are 
constant. 

If we want to allow for transitions between one energy level and another, we 
must introduce a time-dependent potential (quantum dynamics). There are precious 
few exactly solvable problems in quantum dynamics. However, if the time-dependent 
portion of the Hamiltonian is small compared to the time-independent part, it can be 
treated as a perturbation. My purpose in this chapter is to develop time-dependent 


2QR 
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perturbation theory, and study its most important application: the emission or absorp¬ 
tion of radiation by an atom—a process known in the old Bohr theory as a quantum 
jump. 


9.1 TWO-LEVEL SYSTEMS _ 

To begin with, let us suppose that there are just two states of the (unperturbed) system, 
\jr a and f b . They are eigenstates of the unperturbed Hamiltonian Hy. 

Hoi/a = E a fa and Hofb = E h f b , [9.1] 

and they are orthonormal: 


(fa\itb) = &ab- [ 9 - 2 ] 

Any state can be expressed as a linear combination of them; in particular, 

'I'(O) = c a fa + Cbtyb- [9.3] 

The states \jr a and i/a might be position-space wave functions, or spinors, or 

something more exotic—it doesn’t matter. It is the time dependence that concerns 
us here, so when I write 'l'(f), I simply mean the state of the system at time t. 
In the absence of any perturbation, each component evolves with its characteristic 
exponential factor: 

¥(0 = c a f a e- ,E -’l h + c b * b e- ,E ‘ , > i . [9.4] 

We say that \c a 2 is the “probability that the particle is in state \ff a "—by which we 
really mean the probability that a measurement of the energy would yield the value 
E a . Normalization of ^ requires, of course, that 

| c * 1 2 + | q ,| 2 = 1. [9.5] 


9.1.1 The Perturbed System 

Now suppose we turn on a time-dependent perturbation H'(t). Since \j/ a and fj, 
constitute a complete set, the wave function W(r) can still be expressed as a linear 
combination of them. The only difference is that c a and c h are now functions oft : 

'l'(t) = c a (t)f a e~ iEat/ri + Cb(t)f b e~ iEht/h . [9.6] 

[I could absorb the exponential factors into c a (0 and c b (t), and some people prefer to 
do it this way, but I think it is nicer to keep visible that part of the time dependence that 
would be present even without the perturbation.] The whole problem is to determine 
c a and Cb as functions of time. If, for example, the particle started out in the state f a . 
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so that Ca(0) = 1 and q,(0) = 0, and at some later time t\ we find that c a (t]) = 0. 
Cb{t\) = 1, we shall report that the system underwent a transition from \j/ a to fa. 

We solve for c a (0 and q, (t) by demanding that ^ (t) satisfy the time- dependent 
Schrodinger equation, 

HV = ih — , where H=H 0 + H'(t). [9.7] 

dt 

From Equations 9.6 and 9.7, we find 

CclHoMe- ,E ‘‘/* + c b lHot b ]e~ iE ^ + c a [H'f a )e- iE - ,lh 

+ c b [H'f b \e-' E> ' lh = + c b f 

+ e~‘ E -» + c b f b (-tf) e~‘ E fa 

In view of Equation 9.1, the first two terms on the left cancel the last two terms on 
the right, and hence 

CaW'ta]e-‘ E -' / ’' + c b [H'f b \e- ,E ‘'/ h = it, + i b f b e -' E -^]. [9.8] 

To isolate c a , we use the standard trick: Take the inner product with i fr a , and 
exploit the orthogonality of fa and fa (Equation 9.2): 

c a {ir a \H'W a )e- iE -' lh + c b {t a \H'\t b )e- ,El -'l* = it,c a e-‘ E -' / ’'. 

For short, we define 

= [9.9] 

note that the Hermiticity of H' entails = (HC)*. Multiplying through by 
~(i jTi)e lEat/h , we conclude that 

ia = \caKa + . [9.10] 

Similarly, the inner product with fa picks out q, : 

c„W b \H'\f<,)e- ,E - l ' , ‘ +c b (t b \H'\f b )e- ,Et,IE = iU b e~ iE *'l", 

and hence 

6 b = --{c b H' bb + c a H' ba e“- E >- E -' > " t ] . [9.11] 

Equations 9.10 and 9.11 determine c a (t) and Cb(t); taken together, they are 
completely equivalent to the (time- dependent) Schrodinger equation, for a two-level 
system. Typically, the diagonal matrix elements of H' vanish (see Problem 9.4 for 
the more general case in which the diagonal terms are not zero): 

Ka = HU=0. 


[9.12] 



Sec. 9.1: Two-Level Systems 301 


In that case the equations simplify: 


where 


ca = -\H' ab e-^c b , 
Cb = 


COo = 


E b -E a 

h 


(We’ll assume that E b > E a , so co G > 0.) 


[9.13] 


[9.14] 


♦Problem 9.1 A hydrogen atom is placed in a (time-dependent) electric field E = 
E(t)k. Calculate all four matrix elements H[- of the perturbation H' = —eEz 
between the ground state (n = 1) and the (quadruply degenerate) first excited states 
(n = 2). Also show that H' u — 0 for all five states. Note : There is only one integral 
to be done here, if you exploit oddness with respect to z. As a result, only one of the 
n = 2 states is “accessible” from the ground state by a perturbation of this form, and 
therefore the system functions as a two-level configuration— assuming transitions to 
higher excited states can be ignored. 

♦Problem 9.2 Solve Equation 9.13 for the case of a time-independent perturbation, 
assuming that c a (0) = 1 and q,( 0) = 0. Check that \c a (t)\ 2 + |o,(r)| 2 = 1. Note: 
Ostensibly, this system oscillates between “pure and “some \jf b \ Doesn’t this 
contradict my general assertion that no transitions occur for time-independent pertur¬ 
bations? No, but the reason is rather subtle: In this case i fr a and i fr b are not, and never 
were, eigenstates of the Hamiltonian—a measurement of the energy never yields E a 
or E b . In time-dependent perturbation theory we typically contemplate turning on the 
perturbation for a while, and then turning it off again, in order to examine the system. 
At the beginning, and at the end, i js a and i fr b are eigenstates of the exact Hamilto¬ 
nian, and only in this context does it make sense to say that the system underwent 
a transition from one to the other. For the present problem, then, assume that the 
perturbation was turned on at time t = 0, and off again at time t —this doesn’t affect 
the calculations, but it allows for a more sensible interpretation of the result. 

Problem 9.3 Suppose the perturbation takes the form of a delta function (in time): 


H' = US(t - t 0 ); 


assume that U aa = U bb = 0, and let U ab = a. If c a (— oo) = 1 and c b (— oo) = 0, 
find c a (t ) and c b {t), and check that \c a (t)\ 2 + |q,( 0| 2 = 1. What is the probability 
(P a ~+ b ) that a transition occurs? Hint. Refer to Problem 2.24. Answer : P a -> b = 
(|a| 2 /» 2 )/(l + l«l 2 /4S 2 ) 2 . 
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9.1.2 Time-Dependent Perturbation Theory 

So far, everything is exact : We have made no assumption about the size of the pertur¬ 
bation. But if H' is “small”, we can solve Equations 9.13 by a process of successive 
approximations, as follows. Suppose the particle starts out in the lower state: 

Q(0) = 1, q,(0) =0. [9.15] 

If there were no perturbation at all, they would stay this way forever: 

Zeroth Order: 

cf\t) = 1, 

To calculate the first-order approximation 
Equation 9.13: 

First Order: 

dc a n\ 

■^ = ° =» ^« = 1; 

^ => 4‘ ) = -- jf dt' . [9.17] 

Now we insert these expressions on the right to obtain the second -order approxima¬ 
tion: 


cf(r) = 0. [9.16] 

, we insert these values on the right side of 


Second Order: 


dc a 

~dt 


cf\t) = 1 


rH'e 




4 f H^Oe-^ f H' ba (t' r )e ico ° l " dt 1 
* Jo Jo 


dt 1 , 


[9.18] 


while cb is unchanged, cf\t) = c b \t). [Notice that in my notation c^Ht) includes 
the zeroth order term; the second-order correction would be the integral term alone.] 
In principle, we could continue this ritual indefinitely, always inserting the 
n lh -order approximation into the right side of Equation 9.13 and solving for the 
(n + 1)* order. Notice that c a is modified in every even order, and q, in every 
odd order. Incidentally, the error in the first-order approximation is evident in the fact 
that |c£ 1} (0l 2 + |4 1} (0| 2 # 1 (the exact coefficients must, of course, obey Equation 
9.5). However, \c^(t)\ 2 + k^V)! 2 is equal to 1 to first order in H', which is all 
we can expect from a first-order approximation. And the same goes for the higher 
orders. 
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❖❖ Problem 9.4 Suppose you don’t assume that H^ a = H bb = 0. 

(a) Find c a (t) and c b (t) in first-order perturbation theory, for the case c a (0) — 
1, q,( 0) = 0. Show that |c^(t)| 2 + |c^(0l 2 = 1* to first order in H'. 

(b) There is a nicer way to handle this problem. Let 

1 j f‘ H'it'W , i f H'(t')dt' rrv im 

d a = e* Jo c a , d b = e h Jv ' c b . [9.19] 

Show that 

do = d b = -i<r" H^e^d.. [9.20] 

n n 

where ^ 

<P (0 = \j - H' bb (t')]dt'. [9.21] 

So the equations for d a and d b are identical in structure to Equation 9.13 (with 
an extra factor e l<i> tacked onto H'). 

(c) Use the method in part (b) to obtain c a {t) and c b (t) in first-order perturbation 
theory, and compare your answer to (a). Comment on any discrepancies. 

❖Problem 9.5 Solve Equation 9.13 to second order in perturbation theory, for the 
general case c a (0) = a, c b (Q) — b. 

❖❖Problem 9.6 Calculate c a (t) and c b (t), to second order, for the perturbation in 
Problem 9.2. Compare your answer with the exact result. 


9.1.3 Sinusoidal Perturbations 

Suppose the perturbation has sinusoidal time dependence: 

H'(r, t) = V(r)cos(cot), [9.22] 

so that 

H' ab = VabCos(cot), [9.23] 

where 

V ab ^(faWM. [9.24] 

(As before, I’ll assume that the diagonal matrix elements vanish, since this is almost 
always the case in practice.) To first order (from now on we’ll work exclusively in 
first order) we have (Equation 9.17) 

c b (t) = - l -V ba f cos icot')e lmt ' dt' = ( |>®o+^)r' + dt f 

ft Jo 2h Jq I J 
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y ba r e I '( fU o+") / _ 1 gi(<wo-«)f _ 1 

2h |_ coq + a> o>o — 


[9.25] 


This is the answer, but it’s a little cumbersome to work with. Things simplify 
substantially if we restrict our attention to driving frequencies (to) that are very close to 
the transition frequency (a>o), so that the second term in the square brackets dominates; 
specifically, we assume 

co 0 T co^> \co 0 - co\. [9.26] 


This is not much of a limitation, since perturbations at other frequencies have a 
negligible probability of causing a transition anyway} Dropping the first term, we 
have 

V, 

Cb (t) = 

2h oj() — (o L 


. Vba_ sin[Qo - (x>)tj2) ^_ u>)tn 
Ti oj() — w 


[9.27] 


The transition probability—the probability that a particle which started out in the 
state \]/ a will be found, at time t, in the state i///,—is 


p^bit) = \cb(t) I 2 s 


|Faa | 2 sin 2 [(a>o — a>)r/2] 

h 2 (coq ~ to ) 2 


[9.28] 


The most remarkable feature of this result is that, as a function of time, the 
transition probability oscillates sinusoidally (Figure 9.1). After rising to a maximum 
°f \ V a b\ 2 /h 2 {coQ — co ) 2 —necessarily much less than 1, else the assumption that the 
perturbation is “small” would be invalid— it drops back down to zero! At times 
t„ = 2 n 7 i/\tn 0 — co |, where n = 1, 2, 3,..., the particle is certain to be back in the 
lower state. If you want to maximize your chances of provoking a transition, you 
should not keep the perturbation on for a long period: You do better to turn it off after 
a time ji/\coq-co\, and hope to “catch” the system in the upper state. In Problem 9.7 
it is shown that this “flopping” is not an artifact of perturbation theory—it also occurs 
in the exact solution, though the flopping frequency is modified somewhat. 

As I noted earlier, the probability of a transition is greatest when the driving 
frequency is close to the “natural” frequency coq. This is illustrated in Figure 9.2, 
where P a -+b is plotted as a function of co. The peak has a height of (| V a f\t j2h) 2 and 
a width Aitjt\ evidently it gets higher and narrower as time goes on. (Ostensibly, 
the maximum increases without limit. However, the perturbation assumption breaks 
down before it gets close to 1, so we can believe the result only for relatively small t. 
In Problem 9.7 you will see that the exact result never exceeds 1.) 


Tn the following sections we will be applying this theory to the case of light, for which co ~ 10 14 
Hz, so the denominator in both terms is huge, except (for the second one) in the neighborhood of coo. 
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P(t). 


Wab\ f 
h( O3 0 -O3) 


Figure 9.1: Transition probability as a function of time, for a sinusoidal pertur¬ 
bation (Equation 9.28). 


A A AT 

2n 4 n 6 n 

|C 0 q — co| |( 0 0 — co| |( 0 0 - 0 )| 



Figure 9.2: Transition probability as a function of driving frequency (Equation 9.28). 


♦♦Problem 9.7 The first term in Equation 9.25 comes from the e ioyt j 2 part of cos(tuf), 
and the second from e~ im /2. Thus dropping the first term is formally equivalent to 
writing H' = (V/2)e~ lwt , which is to say, 

U‘ ba = V f [9-29] 

[The latter is required to make the Hamiltonian matrix Hermitian—or, if you prefer, 
to pick out the dominant term in the formula analogous to Equation 9.25 for c a (t).] 
If you make this so-called rotating wave approximation at the beginning of the 
calculation, Equation 9.13 can be solved exactly, with no need for perturbation theory 
and no assumption about the strength of the field. 

(a) Solve Equation 9.13 in the rotating wave approximation (Equation 9.29) for the 
usual initial conditions: c a (0) = 1, q,( 0) — 0. Express your results [ c a (t ) and 
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0,(0] in terms of the Rabi flopping frequency, 

u>r = ^(<W - a*) 2 + (1^1.1 A> 2 - [9.30] 

(b) Determine the transition probability, P a ^b(t), and show that it never exceeds 1 . 
Confirm that |c a (0! 2 + lo,(OI 2 = 1- 

(c) Check that P a -+b(t) reduces to the perturbation theory result (Equation 9.28) 
when the perturbation is “small”, and state precisely what small means in this 
context, as a constraint on V. 

(d) At what time does the system first return to its initial state? 


9.2 EMISSION AND ABSORPTION OF RADIATION _ 

9.2.1 Electromagnetic Waves 

An electromagnetic wave (I’ll refer to it as “light,” though it could be infrared, ul¬ 
traviolet, microwave. X-ray, etc.; these differ only in their frequencies) consists of 
transverse (and mutually perpendicular) oscillating electric and magnetic fields (Fig¬ 
ure 9.3). An atom, in the presence of a passing light wave, responds primarily to the 
electric component. If the wavelength is long (compared to the size of the atom), we 
can ignore the spatial variation in the field 2 ; the atom, then, is exposed to a sinusoidally 
oscillating electric field 

E = E Q cos(cot) k [9-31] 

(for the moment I’ll assume that the light is monochromatic and polarized along the 
z-direction). The perturbing Hamiltonian is 3 

H f = — q Eqz cos(cot), [9.32] 

where q is the charge of the electron. 4 Evidently 5 

H' ba = -pE 0 cos(cot), where p = q{f b \z\fa)- [9-33] 


2 For visible light A ~ 5000 A, while the diameter of an atom is around 1 A, so this approximation 
is reasonable; but it would not be for X-rays. Problem 9.20 explores the effect of'spatial variation in the 
field. 

3 The energy of a charge q in a static field E is —q f E ■ dr. You may well object to the use of 
an electroifafir formula for a manifestly time-dependent field. I am implicitly assuming that the period of 
oscillation is long compared to the time it takes the charge to move around (within an atom). 

4 As usual, we assume that the nucleus is heavy and stationary; it is the wave function of the electron 
we are concerned with. 

5 The letter p is supposed to remind you of electric dipole moment (for which, in electrodynamics, 
the letter p is customarily used—in this context it is rendered as a squiggly p to avoid confusion with 
momentum). In fact, p is the off-diagonal matrix element of the z-component of the dipole moment 
operator qr. 
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Figure 9.3: An electromagnetic wave. 


Typically, f is an even or odd function of z\ in either case z\f\ 2 is odd, and integrates 
to zero (see Problem 9.1 for some examples). This licenses our usual assumption that 
the diagonal matrix elements of H' vanish. Thus the interaction of light with matter 
is governed by precisely the kind of oscillatory perturbation we studied in Section 
9.1.3, with 

V ha = -pE 0 . [9.34] 

9.2.2 Absorption, Stimulated Emission, and Spontaneous Emission 

If an atom starts out in the “lower” state tfr a , and you shine a polarized monochromatic 
beam of light on it, the probability of a transition to the “upper” state is given by 
Equation 9.28, which (in view of Equation 9.34) takes the form 

JU.@) = p.35] 

\ h J {(Oo~ co) 2 

In this process, the atom absorbs energy E b - E a = Tuoq from the electromagnetic 
field. We say that it has “absorbed a photon” (Figure 9.4a). [As I mentioned earlier, 
the word “photon ” really belongs to quantum electrodynamics (the quantum theory 
of the electromagnetic field), whereas we are treating the field itself classically. But 
this terminology is convenient, as long as you don’t read more into it than is really 
there.] 



(a) Absorption (b) Stimulated emission (c) Spontaneous emission 

Figure 9.4: Three ways in which light interacts with atoms: (a) absorption, 

(b) stimulated emission, (c) spontaneous emission. 



308 Chap. 9 Time-Dependent Perturbation Theory 


I could, of course, go back and run the whole derivation for a system that starts 
off in the upper state [c a (0) = 0, c&(0) = 1]. Do it for yourself, if you like; it comes 
out exactly the same —except that this time we’re calculating P^ a — |c a (f)l 2 , 
probability of a transition down to the lower level: 

= [9.36] 

\ ft ) (coo - CO) 

(It has to come out this way—all we’re doing is switching a *+ b, which substitutes 
—co o for ojq. When we get to Equation 9.25 we keep the first term, with —coq + co in 
the denominator, and the rest is the same as before.) But when you stop to think of it. 
this is an absolutely astonishing result: If the particle is in the upper state, and you 
shine light on it, it can make a transition to the lower state, and in fact the probability 
of such a transition is exactly the same as for a transition upward from the lower state. 
This process, which was first discovered by Einstein, is called stimulated emission. 

In the case of stimulated emission the electromagnetic field gains energy hcoo 
from the atom; we say that one photon went in and two photons came out—the original 
one that caused the transition plus another one from the transition itself (Figure 9.4b). 
This raises the possibility of amplification , for if I could obtain a bottle of atoms, all 
in the upper state, and trigger it with a single incident photon, a chain reaction would 
occur, with the first photon producing 2, and these 2 producing 4, and so on. We’d 
have an enormous number of photons coming out, all with the same frequency and 
at virtually the same instant. This is, of course, the principle behind the laser (fight 
amplification by stimulated emission of radiation). Note that it is essential (for laser 
action) to get a majority of the atoms into the upper state (a so-called population 
inversion), because absorption (which costs one photon) competes with stimulated 
emission (which produces one); if you started with an even mixture of the two states, 
you’d get no amplification at all. 

There is a third mechanism (in addition to absorption and stimulated emission) 
by which radiation interacts with matter; it is called spontaneous emission. Here an 
atom in the excited state makes a transition downward, with the release of a photon but 
without any applied electromagnetic field to initiate the process (Figure 9.4c). This is 
the mechanism that accounts for the normal decay of an atomic excited state. At first 
sight it is far from clear why spontaneous emission should occur at all. If the atom 
is in a stationary state (albeit an excited one), and there is no external perturbation, 
it should just sit there forever. And so it would , if it were really free of all external 
perturbations. However, in quantum electrodynamics the fields are nonzero even in 
the ground state —just as the harmonic oscillator (for example) has nonzero energy 
(to wit, hco/2) in its ground state. You can turn out all the lights, and cool the room 
down to absolute zero, but there is still some electromagnetic radiation present, and 
it is this “zero-point” radiation that serves to catalyze spontaneous emission. When 
you come right down to it, there is really no such thing as truly spontaneous emission; 
it’s all stimulated emission. The only distinction to be made is whether the field that 
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does the stimulating is one that you put there, or one that God put there. In this sense 
it is exactly the reverse of the classical radiative process, in which it’s all spontaneous 
and there is no such thing as stimulated emission. 

Quantum electrodynamics is beyond the scope of this book, 6 but there is a lovely 
argument due to Einstein 7 which interrelates the three processes (absorption, stimu¬ 
lated emission, and spontaneous emission). Einstein did not identify the mechanism 
responsible for spontaneous emission (perturbation by the ground-state electromag¬ 
netic field), but his results nevertheless enable us to calculate the spontaneous emission 
rate, and from that the natural lifetime of an excited atomic state. Before we turn to 
that, however, we need to consider the response of an atom to non-monochromatic, 
unpolarized, incoherent electromagnetic waves coming in from all directions—such 
as it would encounter, for instance, if it were immersed in thermal radiation. 

9.2.3 Incoherent Perturbations 


The energy density in an electromagnetic wave is* 


u = 


[9.37] 


where Eq is (as before) the amplitude of the electric field. So the transition probability 
(Equation 9.36) is (not surprisingly) proportional to the energy density of the fields: 


2 u 2 sin 2 [(oj 0 — (d)tfl\ 

6o h {(OQ — to) 2 


[9.38] 


But this is for a monochromatic perturbation, consisting of a single frequency to. In 
many applications the system is exposed to electromagnetic waves at a whole range 
of frequencies; in that case u ->■ p(co)dco , where p{(d)dto is the energy density in the 


6 For an especially nice treatment, see Rodney Loudon, The Quantum Theory of Light , 2nd ed. 
(Oxford: Clarendon Press, 1983). 

7 Einstein’s paper was published in 1917, well before the Schrodinger equation. Quantum electro¬ 
dynamics comes into the argument via the Planck blackbody formula (Equation 5.112), which dates from 
1900. 

8 See, for example, D. Halliday and R. Resnick, Fundamentals of Physics, 3rd ed., extended (New 
York: John Wiley & Sons, 1988), Section 38-5. In general, the energy per unit volume in electromagnetic 
fields is 


u = (Co /2)E 2 + (1/2/xo )B 2 . 

For electromagnetic waves, the electric and magnetic contributions are equal, so 
u = co E 2 = cqEq cos 2 (cot), 

and the average over a full cycle is (cq/2 )E 2 , since the average of cos 2 (or sin 2 ) is 1/2. 
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frequency range dco, and the net transition probability takes the form of an integral: 9 


Pb^ait) = -^IPI 2 f 

€0tl JO 


p(co) 


sin 2 [(ct*o ~ co)t/2] 


(coo - CO ) 2 


dco. 


[9.39] 


Ordinarily, the term in curly brackets is sharply peaked about coo (Figure 9.2), 
whereas p{co) is relatively broad; in that case we may as well replace p{co ) by p{coo) 
and take it outside the integral: 


Pb^a(t) = —-irpicoo) 

€oh Jo 


r 


sin 2 [(a>o — co)tf 2] 
(COq - co) 2 


dco. 


[9.40] 


Changing variables to x = (euo“<w)t/2, extending the limits of integration tox = ±oo 
(since the integrand is essentially zero out there anyway), and looking up the definite 


integral 

sin 2 x 

1 —dx = n, 

[9.41] 

we find 

J -oo 

P b ^a(0 = ^~j-p{C0 Q )t. 

€ott 

[9.42] 


This time the transition probability is proportional to t. The bizarre “flopping” phe¬ 
nomenon characteristic of a monochromatic perturbation gets “washed out” when we 
hit the system with an incoherent spread of frequencies. In particular, the transition 
rate ( R = dP/dt ) is now a constant : 


R b ^ = t 9 - 43 J 

e 0 £ 2 

So far, we have assumed that the perturbing wave is coming in along the 
x-direction (Figure 9.3) and polarized in the 2 -direction. But we shall be interested 
in the case of an atom bathed in radiation coming from all directions, and with all 
possible polarizations; the energy in the fields [p(co)\ is shared equally among these 
different modes. What we need, in place of \p\ 2 , is the average of | n * p| 2 , where 

p = q{ifb\r\i// a ) [9.44] 

(generalizing Equation 9.33), and the average is over both polarizations ( n ) and over 
all incident directions. This averaging can be carried out as follows. 


9 Equation 9.39 assumes that the perturbations at different frequencies are independent, so that the 
total transition probability is a sum of the individual probabilities. If the different components are coherent 
(phase correlated), then we should add amplitudes [ci(t)], not probabilities i\cbU)\ 2 ), and there will be 
cross-terms. For the applications we will consider the perturbations are always incoherent. 
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Polarization: For propagation in the z-direction, the two possible polarizations 
are t and /, so the polarization average (subscript p ) is 

(» • Pf p = P) 2 + 0 ' P) 2 ] = \(Pl + P 2 ) = \fP 2 sin2 E9.45] 

where 9 is the angle between p and the direction of propagation. 

Propagation direction: Now let’s set the polar axis along p and integrate over 
all propagation directions to get the polarization-propagation average (subscript pp ): 

(n • p) 2 pp = ^ j ^p 2 sin 2 sin 9 dO d<f> - J sin 3 9 d9 = [9.46] 


So the transition rate for stimulated emission from state b to state a, under the 
influence of incoherent, unpolarized light incident from all directions, is 



[9.47] 


where p is the matrix element of the electric dipole moment between the two states 
(Equation 9.44) and p(co o) is the energy density in the fields, per unit frequency, 
evaluated at coq — (E b - E a )/ti. Ui 


9.3 SPONTANEOUS EMISSION _ 

9.3.1 Einstein’s A and B coefficients 

Picture a container of atoms, N a of them in the lower state (^ a ), and N b of them in 
the upper state (^). Let A be the spontaneous emission rate, 11 so that the number of 
particles leaving the upper state by this process, per unit time, is N b A. n The transition 
rate for stimulated emission, as we have seen (Equation 9.47), is proportional to 
the energy density of the electromagnetic field—call it B ba p(coo). The number of 
particles leaving the upper state by this mechanism, per unit time, is N b B ba p(a> 0 ). 
The absorption rate is likewise proportional to p(coo )—call it B ab p(co o); the number 
of particles per unit time joining the upper level is therefore N a B ab p{(o o). All told, 
then, 

^ = —N b A - N b B ba p(a>o) + N a B ab p(a>o). [9.48] 


10 This is a special case of Fermi’s Golden Rule for time-dependent perturbation theory, 

11 Normally I’d use R for a transition rate, but out of deference to derAlte everyone follows Einstein ’ s 
notation in this context. 

12 Assume that N a and N b are very large, so we can treat them as continuous functions of time, and 
ignore statistical fluctuations. 
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Suppose that these atoms are in thermal equilibrium with the ambient field, so 
that the number of particles in each level is constant . In that case dN b /dt = 0, and 
it follows that 

P( " 0) = (N a /N b )B ab - B ba ' [9 ' 49i 

On the other hand, we know from elementary statistical mechanics 13 that the number 
of particles with energy E , in thermal equilibrium at temperature T, is proportional 
to the Boltzmann factor, exp(-E/k B T), so 


N a 

aT 


e ~E a /k B T 
g — Eyj kgT 


h(oo/k B T 


and hence 


pifo o) = ■ 


[9.50] 


[9.51] 


e h m /k B T Bab _ Bba • 

But Planck’s blackbody formula Equation 5.112 tells us the energy density of 
thermal radiation: 

^ = j ; [9,52] 

comparing the two expressions, we conclude that 


Bab — Bb a 


and 


ufh 


,B ba . 


[9.53] 


[9.54] 


Equation 9.53 confirms what we already knew: that the transition rate for stimulated 
emission is the same as for absorption. But it was an astonishing result in 1917— 
indeed, Einstein was forced to “invent” stimulated emission in order to reproduce 
Planck’s formula. Our present attention, however, focuses on Equation 9.54, for this 
tells us the spontaneous emission rate (A )—which is what we are looking for—in 
terms of the stimulated emission rate [Bb a p(coo )]—which we already know. From 
Equation 9.47 we read off 


B ba '■ 


3eo» : 


IPl 


and it follows that the spontaneous emission rate is 



[9.55] 


[9.56] 


13 See, for example, Charles Kittel and Herbert Kroemer, Thermal Physics, 2nd ed. (New York: 
Freeman, 1980), Chapter 3. 
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Problem 9.8 As a mechanism for downward transitions, spontaneous emission 
competes with thermally stimulated emission (stimulated emission for which Planck 
radiation is the source). Show that at room temperature (T = 300 K) thermal stimula¬ 
tion dominates for frequencies well below 5 x 10 12 Hz, whereas spontaneous emission 
dominates for frequencies well above 5 x 10 12 Hz. Which mechanism dominates for 
visible light? 


9.3.2 The Lifetime of an Excited State 


Equation 9.56 is our fundamental result; it gives the transition rate for spontaneous 
emission. Suppose, now, that you have a bottle full of atoms, with N b (t) of them in 
the excited state. As a result of spontaneous emission, this number will decrease as 
time goes on; specifically, in a time interval di you will lose a fraction A dt of them: 

dN b = —AN b dt [9.57] 


(assuming there is no mechanism to replenish the supply). 14 Solving for N b (t), we 
find 

N„(t) = N b (0)e- A ‘-, [9.58] 

evidently the number remaining in the excited state decreases exponentially, with a 
time constant 

r = -. [9.59] 

A 

We call this the lifetime of the state—technically, it is the time it takes for N b (t) to 
reach 1 je & 0.368 of its initial value. 

I have assumed all along that there are only two states for the system, but 
this was just for notational simplicity—the spontaneous emission formula (Equa¬ 
tion 9.56) gives the transition rate for \j/ b —>• \fr u regardless of any other allowed states 
(see Problem 9.14). Typically, an excited atom has many different decay modes (that 
is, \[r b can decay to a large number of different lower-energy states, xj / ai , ^a 2 , , • ■ •) • 
In that case the transition rates add , and the net lifetime is 


1 

A x + A 2 + A 3 + • • • 


[9.60] 


Example. Suppose a charge q is attached to a spring and constrained to oscillate 
along the x-axis. Say it starts out in the state \n) (Equation 2.50) and decays by 
spontaneous emission to state \ri). From Equation 9.44, we have 

p = q(n\x\n')i. 


14 This situation is not to be confused with the case of thermal equilibrium, which we considered 
in the previous section. We assume here that the atoms have been lifted out of equilibrium, and are in the 
process of cascading back down to their equilibrium levels. 
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You calculated the matrix elements of x back in Problem 3.50: 

(n\x\n ! ) = J -^z(V« 7 3 n ,„'-i + 1 ), 

V 2 mco 

where co is the natural frequency of the oscillator (I use the overbar to distinguish it 
from the frequency of the emitted radiation, although as we’ll see in a moment the 
two turn out to be equal, and at that point I’ll drop the bar). But we’re talking about 
emission, so n' must be lower than n; for our purposes, then, 


Evidently transitions occur only to states one step lower on the “ladder,” and the 
frequency of the photon emitted is 


E n — E n > (i n + 1 /2)hco — («' + 1 /2)ha> 


■ (n - n')7o = cd. [9.62] 


Not surprisingly, the system radiates at the classical oscillator frequency. The transi¬ 
tion rate (Equation 9.56) is 

2 2 

4 nq co 

A = [9 - 63) 
and the lifetime of the n 111 stationary state is 


Meanwhile, each radiated photon carries an energy hco, so the power radiated is Atico: 
q 2 co 2 

P - -j (nhco), 

oneomc* 

or, since the energy of an oscillator in the n th state is E = (n -j- 1 /2 )hco, 

p =6 19651 

This is the average power radiated by a quantum oscillator with (initial) energy E. 

For comparison, let’s determine the average power radiated by a classical os¬ 
cillator with the same energy. According to classical electrodynamics, the power 
radiated by an accelerating charge q is given by the Larmor formula: 15 


15 See, for example, David J. Griffiths Introduction to Electrodynamics , 2nd ed. (Englewood Cliffs. 
NJ: Prentice-Hall, 1989), Section 9.1.4. 
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For a harmonic oscillator with amplitude jco, x(0 = xo cos(a>0> and the acceleration 
is a = —xqco 2 cos (cot). Averaging over a full cycle, then, 


\2tT€(,C 3 ‘ 

But the energy of the oscillator is E = (1/2 )mco 2 XQ, so Xq = 2E/mco 2 , and hence 


P = 


6n€omc 3 


E. 


[9.67] 


This is the average power radiated by a classical oscillator with energy E. In the 
classical limit ( 7i 0) the classical and quantum formulas agree 16 ; however, the 

quantum formula (Equation 9.65) protects the ground state: If E — (1/2 )hco the 
oscillator does not radiate. 


Problem 9.9 The half-life (fi/ 2 ) of an excited state is the time it would take for 
half the atoms in a large sample to make a transition. Find the relation between t \/2 
and t (the “lifetime” of the state). 

***Problem 9.10 Calculate the lifetime (in seconds ) for each of the four n = 2 states 
of hydrogen. Hint: You’ll need to evaluate matrix elements of the form {i/^ioo|x | if 200 ), 
(^ioolTl^ 2 ii), and so on. Remember that x = r sin# cos 0, y = r sin# sin0, and 
z = r cos 0. Most of these integrals are zero, so scan them before you start calculating. 
Answer : 1.60 x 10 -9 seconds for all except ^ 200 ? which is infinite. 


9.3.3 Selection Rules 

The calculation of spontaneous emission rates has been reduced to a matter of eval¬ 
uating matrix elements of the form 


(fb\r\^a)- 

As you will have discovered if you worked Problem 9.10 (if you didn’t, go back right 
now and do so!), these quantities are very often zero, and it would be helpful to know 
in advance when this is going to happen, so we don’t waste a lot of time evaluating 
unnecessary integrals. Suppose we are interested in systems like hydrogen, for which 
the Hamiltonian is spherically symmetrical. In that case we may specify the states 
with the usual quantum numbers «, /, and m, and the matrix elements are 

(n r l'm f \r\nlm). 


16 In fact, if we express P in terms of the energy above the ground state, the two formulas are 
identical. 
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Clever exploitation of the angular momentum commutation relations and the her- 
miticity of the angular momentum operators yields a set of powerful constraints on 
this quantity. 


Selection rules involving m and m f : Consider first the commutators of L z with 
x, y, and z, which we worked out in Chapter 4 (see Equation 4.122): 

[L z , x] = ihy, [L z , y ] = -ihx, [L z , z ] = 0. [9.68] 

From the third of these it follows that 

0= (n'l'm r \[L z , z]\nlm) = (n'l'm'KLzZ - zL z )\nlm) 

— {ril'm'\[{mh)z — z(mti)]\nlm) = (m' — m)^(«7 / m / |zj«/m). 

Conclusion: 

Either m' = m, or else {n'Vm'\z\nlm) — 0. [9.69] 

So unless m' = m, the matrix elements of z are always zero. 

Meanwhile, from the commutator of L z with x we get 

{n f l'm'\[L z , x]\nlm) = (nT f m f \(L z x — xL z )\nlm) 

= (m' — m)h(n f Tm r \x\nlm ) — i?z(n7 / m / |y]n/m). 

Conclusion: 

(m — m){n , Tm , \x\nlm) — i {nT'm'\y\nlm) [9.70] 

So you never have to compute matrix elements of y —you can always get them from 
the corresponding matrix elements of x. 

Finally, the commutator of L z with y yields 

{nT'm'\[L z , y]\nlm) = {n ! l'm'\{L z y — yL z )\nlm) 

— (m — m)ti{ril'm'\y\nlm) — —iTi(ril ! m\x\nlm). 

Conclusion: 

fra' — m){nT'm'\y\nlm) = — i {n'l'm'\x\nlm) [9.71] 

In particular, combining Equations 9.70 and 9.71, 

(m' — m) 2 {n’l 1 m*\x\nlm) = i(m f — m)(n'l'm'\y\nlm) = (n ! l'm \x\nlm), 
and hence 

either (m 1 — m) 2 = 1, or else {n f l'm*\x\nlm) = {riVm'\y\nlm) = 0. [9.72] 
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From Equations 9.69 and 9.72 we obtain the selection rule for m: 

No transitions occur unless Am = ±1 or 0. [9.73] 

This is an easy result to understand if you remember that the photon carries 
spin 1, and hence its value of m is 1, 0, or — l 17 ; conservation of (the 
z-component of) angular momentum requires that the atom give up whatever the 
photon takes away. 

Selection rules involving l and V: In Problem 9.11 you are asked to derive the 
following commutation relation: 

[L 2 , [L 2 , r]] = 2h 2 (rL 2 + L 2 r). [9.74] 

As before, we sandwich this commutator between ( n'l'm r \ and | nlm) to derive the 
selection rule: 

{nl'm | [L 2 , [L 2 , r]] | nlm) = 2h 2 {n Vm \(rL 2 + L 2 r)\nlm) 

= 2+ 1) + V{1' + l)](n7W|r|n/m) = <n7W|(L 2 [L 2 , r] - [L 2 , r]L 2 )\nlm) 

= h 2 [l'(l ' + 1) - /(/ + l)](nTm'\[L 2 , r]| nlm) 

= Ti 2 [l'(l' + 1) - /(/ + \)]{n l'm \{L 2 r - rL 2 )|n/m) 

= h A [V(V + 1) - /(/ + l)] 2 (n7W|r|n/m). 

Conclusion: 

Either 2[/(/ + 1) + l\l’ + 1)] - [/'(/' + 1) - /(/ + l)] 2 
or else (n7W|r|«^”) = 0. 

But 

[/'(/' +1)-/(/ +1)] = (/' + / + l)(/'-/) 

and 

2[i(i + 1 ) + i\r + 1 )] = (/' + / + 1) 2 + (i r — /) 2 — i , 

so the first condition in Equation 9.76 can be written 

[(/' + / + l) 2 -l][(/'-/) 2 -l] = 0. 

17 When the polar axis is along the direction of propagation, the middle value does not occur, and 
if you are only interested in the number of linearly independent photon states, the answer is 2, not 3. 
However, in this case the photon need not be going in the ^-direction, and all three values are possible. 


[9.75] 


[9.76) 


[9.77] 
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1=0 1 = 1 1=2 1=3 

n = 4 


n = 3 


n = 2 


n = 1 

Figure 9.5: Allowed decays for the first four Bohr levels in hydrogen. 

The first factor cannot be zero (unless /' = l = 0—this loophole is closed in Prob¬ 
lem 9.12), so the condition simplifies to l 1 = l ± 1. Thus we obtain the selection rule 
for/: 

No transitions occur unless A/ = ±1. [9.78] 

Again, this result (though scarcely trivial to derive) is easy to interpret : The photon 
carries spin 1, so the rules for addition of angular momentum would allow /' = / + 1, 
V = l, or T = l — 1 (for electric dipole radiation the middle possibility—though 
permitted by conservation of angular momentum—does not occur). 

Evidently not all transitions to lower-energy states can proceed by spontaneous 
emission; some are forbidden by the selection rules. The scheme of allowed transi¬ 
tions for the first four Bohr levels in hydrogen is shown in Figure 9.5. Note that the 
2 S state (i/^oo) is “stuck”: It cannot decay, because there is no lower-energy state 
with / = 1. It is called a metastable state, and its lifetime is indeed much longer 
than that of, for example, the IP states ( 1 /^ 211 , ^ 210 , and ^ 21 - 1 )- Metastable states do 
eventually decay, either by collisions or by what are (misleadingly) called forbidden 
transitions (Problem 9.20), or by multiphoton emission. 

** Problem 9.11 Prove the commutation relation Equation 9.74. Hint: First show 
that 

[L 2 , z] = 2ih(xL y — yL x — ihz). 

Use this and the fact that r • L = r • (r x p) = 0 to demonstrate that 
[L 2 ,[L 2 ,z]]=2h 2 (zL 2 + L 2 z). 

The generalization from z to r is trivial. 
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Problem 9.12 Plug the “loophole” in Equation 9.78 by showing that if /' = / = 0 
then {n f Vm'\r\nlm) = 0. 

** Problem 9.13 An electron in the n = 3, / = 0, m = 0 state of hydrogen decays 
by a sequence of (electric dipole) transitions to the ground state. 

(a) What decay routes are open to it? Specify them in the following way: 

1300) -> \nlm) -» | ril'm’) |100). 

(b) If you had a bottle full of atoms in this state, what fraction of them would decay 
via each route? 

(c) What is the lifetime of this state? Hint : Once it’s made the first transition, it’s 
no longer in the state |300), so only the first step in each sequence is relevant 
in computing the lifetime. When there is more than one decay route open, the 
transition rates add. 


FURTHER PROBLEMS FOR CHAPTER 9 


** Problem 9.14 Develop time-dependent perturbation theory for a multilevel sys¬ 
tem, starting with the generalization of Equations 9.1 and 9.2: 



Hofin = E n fi n , {tj/nlfim) = 

[9.79] 

At time t — 0 we turn on a perturbation H'(t), so that the total Hamiltonian is 


H = Ho + H'(t). 

[9.80] 

(a) Generalize Equation 9.6 to read 



'p(0 = 

[9.81] 

and show that 

c m = - i ~Y, c « H L‘ w -- EMh - 

[9.82] 

where 

H' mn = Itf'IW- 

[9.83] 


(b) If the system starts out in the state fin, show that (in first-order perturbation 
theory) 

Cn (0 = 1 — T f 

ft Jo 


[9.84] 
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and 


c m (t) = - 


= ~jr jf dt', (m # N). 


[9.85] 


(c) For example, suppose H' is constant (except that it was turned on at t = 0 and 
switched off again at some later time t ). Find the probability of transition from 
state N to state m (m ^ N), as a function of t. Answer: 


AiZJf ,2 sin 2 [(^ - P m )tf2h] 
- (EV- E m ) 2 - 


[9.86] 


(d) Now suppose H ! is a sinusoidal function of time: H' — V cos (tot). Making 
the usual assumptions, show that transitions occur only to states with energy 
E m = E N ± Tico , and the transition probability is 


Pn- 


= \VmN \ 2 


sin 2 [(isV — E m ±Ti(o)t/2h] 
(E n — E m ±h(t>) 2 


[9.87] 


(e) Suppose a multilevel system is immersed in incoherent electromagnetic radia¬ 
tion. Using Section 9.2.3 as a guide, show that the transition rate for stimulated 
emission is given by the same formula (Equation 9.47) as for a two-level system. 

Problem 9.15 For the examples in Problem 9.14 (c) and (d), calculate c m (t), to 
first order. Check the normalization condition: 

£km(0l 2 = 1, [9.88] 


and comment on any discrepancy. Suppose you wanted to calculate the probabil¬ 
ity of remaining in the original state i/^v; would you do better to use \c N (t)\ 2 , or 

1 - \c m (t)\ 2 l 

Problem 9.16 A particle starts out (at time t = 0) in the N th state of the infinite 
square well. Now water leaks into the well, and then drains out again, so that the 
bottom is at uniform potential Vq (f), with F 0 (0) = Tq(T) = 0. 

(a) Solve the exact equation (Equation 9.82) for c m (t ) , and show that the wave 
function changes phase, but no transitions to other states occur. Find the phase 
change <p(T) in terms of the function V 0 (t). 

(b) Analyze the same problem in first-order perturbation theory, and compare your 
answers. 


Note: The same result holds whenever the perturbation simply adds a constant 
(constant in x, that is, not in t) to the potential; it has nothing to do with the infinite 
square well as such. See Problem 1.13. 
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♦Problem 9.17 A particle of mass m is initially in the ground state of the (one¬ 
dimensional) infinite square well. At time t = 0 a “brick” is dropped into the well, 
so that the potential becomes 

V 0 , if 0 < x < a/2, 

V (x) = 0, if a/2 < x < a, 

. oo, otherwise, 

where Vo E\. After a time T, the brick is removed, and the energy of the particle 
is measured. Find the probability (in first-order perturbation theory) that the energy 
is now Ei. 

Problem 9.18 Justify the following version of the energy-time uncertainty prin¬ 
ciple (due to Landau): AEAt > Ti/2, where At is the time it takes to execute a 
transition involving an energy change A E, under the influence of a constant pertur¬ 
bation (see Problem 9.14c.) Explain more precisely what A E and At mean in this 
context. 

♦♦♦Problem 9.19 An electron is at rest at the origin, in the presence of a magnetic 
field whose magnitude (Bo) is constant but whose direction rides around at constant 
angular velocity u> on the lip of a cone of opening angle a: 

B(t) = i?o[sina cos(cot)i + sina sin (cot)] + cosa£]. [9.89] 


(a) Construct the 2 x 2 Hamiltonian matrix (Equation 4.158) for this system. 

(b) Find the exact solution to the (time-dependent) Schrodinger equation, assuming 
the particle starts out with spin up. Hint: You can do it from scratch, or by 
noting that in this case the rotating wave approximation is exact, and refering 
to Problem 9.7. Answer. 

[cos(Af/2) + i[(a) + coi cosgO/A] sin(Af/2)] e ~ l(0t/2 
i[(co\ sina)/A] sin(Xt/2)e l<ot/2 



\ [9-90] 


co\ = —eBo/m and X = Jco 2 + col + 2coa>\ cos a. 


(c) Now treat the same problem by (first-order) time-dependent perturbation theory: 
use Equation 9.17 to calculate the (approximate) probability of a transition from 
spin up (the initial state) to spin down, as a function of time, and compare the 
exact answer (from part b). State the criterion on the strength of the field that 
determines whether perturbation theory is applicable in this case. 


♦♦♦Problem 9.20 In Equation 9.31 we assumed that the atom is so small (in compar¬ 
ison to the wavelength of light) that spatial variations in the field can be ignored. The 
true electric field would be 


E(r, t) = Eq cos(k • r - cot). 


[9.92] 
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If the atom is centered at the origin, then k r -C 1 over the relevant volume 
(|k| = 2jt/A, so k • r ~ r/X <^C 1), and that’s why we could afford to drop this 
term. Suppose we keep the first-order correction: 

E(r, t) = E 0 [cos(tur) + (k • r) sin(a>f)]. [9.93] 

The first term gives rise to the allowed (electric dipole) transitions we considered 
in the text; the second gives rise to so-called forbidden (magnetic dipole and elec¬ 
tric quadrupole) transitions (higher powers of k - r lead to even more “forbidden” 
transitions, associated with higher multipole moments 18 ). 

(a) Obtain the spontaneous emission rate for forbidden transitions (don’t bother to 
average over polarization and propagation directions, though this should really 
be done to complete the calculation). Answer. 

2 5 

Rb ^a - l(a|(« * r)(k • r)|£>)| 2 . [9.94] 


(b) Show that for a one-dimensional oscillator, the forbidden transitions go from 
level n to level n — 2, and the transition rate (suitably averaged over h and k) is 


hq 2 co 3 n(n — 1) 
\57t€Qm 2 c 5 


[9.95] 


Find the ratio of the “forbidden” rate to the “allowed” rate, and comment on the 
terminology. (Note: w is the frequency of the photon , not the oscillator.) 

(c) Show that the 2S -» IS transition in hydrogen is not possible even by a “for¬ 
bidden” transition. (As it turns out, this is true for all the higher multipoles as 
well; the dominant decay is in fact by a two-photon emission, and the lifetime 
is about a tenth of a second. 19 ) 

Problem 9.21 We have encountered stimulated emission, (stimulated) absorption, 
and spontaneous emission ... how come there’s no such thing as spontaneous ab¬ 
sorption1 


18 For a systematic treatment (including the role of the magnetic field), see David Park, Introduction 
to the Quantum Theory , 3rd ed. ( New York: McGraw-Hill, 1992), Chapter 11. 

19 See Masataka Mizushima, Quantum Mechanics of Atomic Spectra and Atomic Structure, New 
York: Benjamin, 1970), Section 5.6. 



CHAPTER 10 


THE ADIABATIC 
APPROXIMATION 


10.1 THE ADIABATIC THEOREM 


10.1.1 Adiabatic Processes 

Imagine a perfect pendulum, with no friction or air resistance, oscillating back and 
forth in a vertical plane. If I grab the support and shake it in a jerky manner, the bob 
will swing around in a wild chaotic fashion. But if I very gently and steadily move the 
support (Figure 10.1), the pendulum will continue to swing in a nice, smooth way, in 
the same plane (or one parallel to it) with the same amplitude. This gradual change 
in the external conditions characterizes an adiabatic process. Notice that there are 
two characteristic times involved: 7), the “internal” time, representing the motion of 
the system itself (in this case the period of the pendulum’s oscillations), and T e , the 
“external” time, over which the parameters of the system change appreciably (if the 
pendulum were mounted on an oscillating platform, for example, T e would be the 
period of the platform’s motion). An adiabatic process is one for which T e » 7). 1 

The basic strategy for analyzing an adiabatic process is first to solve the prob¬ 
lem with the external parameters held fixed, and only at the end of the calculation 
allow them to change with time. For example, the classical period of a pendulum 
of (constant) length L is 2 jr*JL/g; if the length is now gradually changing, the 


'For an interesting discussion of classical adiabatic processes, see Frank S. Crawford, Am. J. Phys. 
58, 337 (1990). 
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Figure 10.1: Adiabatic motion: If the 
case is transported very gradually, the 
pendulum inside keeps swinging with the 
same amplitude, in a plane parallel to the 
original one. 


period will presumably be 2 n yjL(t)/g . When you stop to think about it, we actually 
use the adiabatic approximation (implicitly) all the time without noticing it. A case 
in point is our discussion of the hydrogen molecule ion (Section 7.3). We began 
by assuming that the nuclei were at rest, a fixed distance R apart, and we solved 
for the motion of the electron. Once we had found the ground state energy of the 
system as a function of R, we located the equilibrium separation and from the cur¬ 
vature of the graph we obtained the frequency of vibration of the nuclei (Problem 
7.10). In molecular physics this technique (beginning with nuclei at rest, calculating 
electronic wave functions, and using these to obtain information about the positions 
and—relatively sluggish—motion of the nuclei) is known as the Born-Oppenheimer 
approximation. 

In quantum mechanics, the essential content of the adiabatic approximation 
can be cast in the form of a theorem. Suppose that the Hamiltonian changes gradually 
from some initial form H 1 to some final form H ! (Figure 10.2). The adiabatic 
theorem states that if the particle was initially in the nth eigenstate of H l , it will be 


H‘ 



7 


t 


Figure 10.2: A model for adiabatic change in the Hamiltonian, from H l to H f . 
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A 

t | 

[ 

k\|/(x) 


k ¥ -(x) 


2a x a 2a x a 2a x 


(a) (b) (°) 

Figure 10.3: (a) Particle starts out in the ground state of the infinite square well. 

(b) If the wall moves slowly, the particle remains in the ground state, (c) If the 
wall moves rapidly, the particle is left (momentarily) in its initial state. 

carried (under the Schrodinger equation) into the nth eigenstate of H f . (I assume 
that the spectrum is discrete and nondegenerate throughout the transition from H to 
Hf, so there is no ambiguity about the ordering of the states; these conditions can 
be relaxed, given a suitable procedure for “tracking” the eigenfunctions, but I’m not 
going to pursue that here.) 

For example, suppose we prepare a particle in the ground state of the infinite 
square well (Figure 10.3a): 

*'(*> =[ 10 . 1 ] 

If we now gradually move the right wall out to 2a, the adiabatic theorem says that the 
particle will end up in the ground state of the expanded well (Figure 10.3b): 

= m2] 

(apart, perhaps, from a phase factor). Notice that we’re not talking about a small 
change in the Hamiltonian (as in perturbation theory)—this one is a huge change. All 
we require is that it happen slowly. By contrast, if the well expands suddenly, the 
resulting state is still ^(x) (Figure 10.3c), which is a complicated linear combination 
of eigenstates of the new Hamiltonian (Problem 3.48). 

***Problem 10.1 The case of an infinite square well whose right wall expands at a 
constant velocity (v) can be solved exactly. 2 A complete set of solutions is 

4>„(*. 0 = Jf sin (— x) e-W^ilaO/a* [10.3] 

V kj \ w / 

where w(t) = a+viisthe width of the well and = n 2 7 i 2 h 2 / 2 ma 2 isthentha\lowed 


2 S. W. Doescher and M. H. Rice, Am. J. Phys. 37, 1246 (1969). 
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energy of the original well (width a). The general solution is a linear combination 
of the d>’s: 


OO 

*I>(x, t ) = c„d>„(x, t ); 


[10.4] 


the coefficients c n are independent of t. 

(a) Check that Equation 10.3 satisfies the time-dependent Schrodinger equation, 
with the appropriate boundary conditions. 

(b) Suppose a particle starts out {t = 0) in the ground state of the initial well: 

*(*,°) = y?sin(^). 


Show that the expansion coefficients can be written in the form 


c, 



sin(nz) sin(z) dz. 


[10.5] 


where a = mva/2n 2 h is a dimensionless measure of the speed with which 
the well expands. (Unfortunately, this integral cannot be evaluated in terms of 
elementary functions.) 

(c) Suppose we allow the well to expand to twice its original width, so the “exter¬ 
nal” time is given by w(T e ) = 2a. The “internal” time is the period of the 
time-dependent exponential factor in the (initial) ground state. Determine 
T e and 7}, and show that the adiabatic regime corresponds to a <<C 1, so that 
e~ !az = 1 over the domain of integration. Use this to determine the expansion 
coefficients c n . Construct 4>(x, t), and confirm that it is consistent with the 
adiabatic theorem. 

(d) Show that the phase factor in 'Tlx, t) can be written in the form 

*(0 = 4 /^', [ 10 . 6 ] 

where E n (t) = n 2 n 2 h 2 f 2m w 2 is the n th instantaneous eigenvalue, at time t. 
Comment on this result. 
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10.1.2 Proof of the Adiabatic Theorem 


The adiabatic theorem is simple to state, and it sounds plausible, but it is not easy 
to prove. 3 Suppose the time-dependent part of the Hamiltonian can be written in the 


form 4 


H\t) = Vf(t), 


[10.7] 


where f(t) is a function that starts out zero (at t = 0) and increases to 1 (at t = T), 
Figure 10.4. Assume that the particle starts out in the n th eigenstate of the original 
Hamiltonian: 

ip(0) = i/d [10.8] 


and evolves into some state TOO- Our problem is to show that if the function fit) rises 
very gradually, then the probability that the particle, at time T, is in the nth eigenstate 
of the final Hamiltonian {ffl ) is 1. More precisely, we must demonstrate that 


\mT)wl)\ 2 = o' 


if m = n, 
if m / n. 


[10.9] 


(Of course, if the first of these is true, the second has to be, and vice versa. But it is 
not clear at this stage which condition will be easier to prove.) 

Assume for the moment that V is small, so we can use first-order time- 
independent perturbation theory to determine ffL- From Equation 6.12, 

f = fm + ^ km r Vfr* [10.10] 

k^m 



3 The adiabatic theorem is usually attributed to Ehrenfest, who studied adiabatic processes in early 
versions of the quantum theory. The first proof in modem quantum mechanics was given by Bom and 
Fock, Zeit. f. Physik 51, 165 (1928), Other proofs will be found in Messiah, Quantum Mechanics (New 
York: John Wiley & Sons, 1962), Vol. II, Chapter XVII, Section 12, and J-T. Hwang and Philip Pechukas, 
J. Chem. Phys. 67,4640 (1977). The argument given here is suggested by Gasiorowicz, Quantum Physics 
(New York: John Wiley & Sons, 1974), Chapter 22, Problem 6. 

4 The assumption that H' is the product of an operator ( V ) and a (real) function of t is not necessary 
for the theorem itself, but it does make the proof less cumbersome. In Section 10.1.3 we will encounter a 
case in which the different matrix elements of H’ have different (complex) time dependences. As long as 
the adiabatic approximation (in the form of Equation 10.15) holds for each of them, the adiabatic theorem 
itself is valid. 
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where 

Vkm = (ir k \V\fm). [10.11] 

(To simplify the notation, I’ll drop the superscript i on eigenfunctions and eigenvalues 
of the initial Hamiltonian; these are the “unperturbed” states for the problem.) 

Meanwhile, we use first-order time-dependent perturbation theory to determine 
ty(T). From Equation 9.81, 


^(0 = J2 c l ( t )^ l e~ iE,t/ \ [ 10 . 12 ] 


where (Equation 9.84) 


and (Equation 9.85) 


C n {t)~ l- T Vn 

n 



[10.13] 


q(/) = -V,„ //n. [10.14] 

« Jo 

This last integral can be evaluated using integration by parts. Note that 

i(Ei-E H )t r /h _ ~ lfl d [ m- £„)/'/*] 

El - E n dt' L J ’ 

so 

[I dropped the lower limit in the first term, because /(0) — 0.] Now comes the 
adiabatic approximation: We want /(f) to be a very gradual function, so that df/dt 
is extremely small. Specifically, we assume that 


d A«\h^A f , 

dt h 


[10.15] 


then the last term makes a negligible contribution to c { (t), and we conclude that 


V(T) = 


Vi V nn A\ „ V ln 


-Jri 


[10.16] 


where A is the area under the graph of /(f), from 0 to T. 
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Putting together Equations 10.10 and 10.16, and exploiting the orthonormality 
of the initial eigenfunctions, we find that 


(W/) = 


l + l- 


V nn A 




i^hi 2 

(£„ - E k ) 2 


e iE,T/t 


[10.17] 


while, for m ^ n. 




1 + /- 


V nn A 


} 


Vnm 

E m - E n 


V nk V km 


E m - E n 
lAVnnVnm 


, {En - E h )(E m - E k ) 


V nk V km 


h(E m - En) 


, {En - E k ){E m - E k ) 


e iE 


[10.18] 


But wait: These wave functions were only accurate to first order in V , so the second- 
order terms in Equations 10.17 and 10.18 are spurious (we have already thrown away 
quantities of comparable size). To first order, we have 


[10.19] 


It follows that 

|(^(T)|^/)| 2 = 1, [10.20] 

while (for m ^ n) 

|{'I'(T)|^)| 2 =0. [10.21] 

Ostensibly, either of these would suffice to establish the desired result (Equa¬ 
tion 10.9). However, Equation 10.20 is only accurate to first order (in V ), whereas 
Equation 10.21 is accurate to second order (and for that matter to third order as 
well). 5 In truth, Equation 10.20 tells us nothing (it would be valid also for a nonadi- 
abatic transformation); the crucial point is the cancellation of the first-order terms in 
Equation 10.18, for this tells us that there will be no transitions to other states. 6 


5 See Problem 9.15 for a discussion of the analogous situation in ordinary perturbation theory. 

6 In this context the word “ transition ” means from an eigenstate i/r' of the initial Hamiltonian 
(//' ) to a different eigenstate i//^ of the final Hamiltonian (Hi), The adiabatic theorem says that if the 
Hamiltonian changes gradually from H‘ to H- f , there will be no such transitions. By contrast, in the 
previous chapter we were always dealing with eigenstates of the same (unperturbed) Hamiltonian. At the 
end of the process the perturbation was (explicitly or implicitly) turned off, and a “transition” meant from 
one eigenstate of the unperturbed Hamiltonian to another eigenstate of the unperturbed Hamiltonian. The 
transition amplitudes were of first order in H' (Equations 9.17 and 9.85) and the transition probabilities of 
second order (for example. Equations 9.28, 9.86, and 9.87). The essence of the adiabatic theorem (as we 
shall see in the next paragraph) is that the transition amplitudes are only second order, and the transition 
probabilities fourth order in the (small) perturbation. 
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This shows that if the change in the Hamiltonian is both adiabatic and very 
small (so that first-order perturbation theory can be applied), then there will be no 
transitions. But what if the change, though gradual, is not small? In that case we 
chop the interval T into N subintervals, so that the change in the Hamiltonian during 
a single subinterval (A V) is of order V/N; if N it large, then A V is small, and we can 
apply the previous argument to each subinterval. If the transition amplitude (Equation 
10.18) were first order in the perturbation, then the total transition amplitude would 
go like 

[ 10 . 22 ] 

(N steps, each making a contribution proportional to AV ). The net result would be 
of order V, and if V is large, so too would be the transition amplitude. But in fact the 
transition amplitude is second order, so the total goes like 



In the limit as N —> oo, the transition amplitude goes to zero, regardless of the size 
of V. QED 


Problem 10.2 In the beginning of this chapter, I characterized an adiabatic process 
informally as one for which T e > 7). How is this related to the precise condition 
(Equation 10.15) required in the proof (in other words, what are T e and 7) here)? 


10.1.3 An Example 

Imagine an electron (charge — e, mass m) at rest at the origin, in the presence 
of a magnetic field whose magnitude (Bo) is constant but whose direction sweeps out 
a cone, of opening angle a, at constant angular velocity co (Figure 10.5): 

B(t) = Bo [sin a cos {cat) i + sin a sm(cot)j + cos oik. [10.24] 


The Hamiltonian (Equation 4.158) is 


H(t) = —B * S ~ ——-[sina coshwfW* + sin a sin(ct>t)0'v -t-cosaov] 
m 2m y 


( cos ot e lwt sin a 
2 l e l0}t sin a — cos a 


[10.25] 


where 


eB 0 

£Ol = -. 


m 


[10.26] 
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Figure 10.5: Magnetic field sweeps 
around on a cone, at angular velocity co, 
Equation 10.24. 


The normalized eigenspinors of H{t ) are 

x+{t) 

and 


/ cos (a/2) \ 

\e i0)t sin (a/2 )) 


x-if) ■■ 


/ sin(a/2) \ . 
y — e ,0}t cos(a/2) J ’ 


[10.27] 


[10.28] 


they represent spin up and spin down, respectively, along the instantaneous direction 
ofB(t ) (see Problem 4.31). The corresponding eigenvalues are 


F -A Wl 

£ ± -t—• 

Suppose the electron starts out with spin up, along B(0): 

v(0)=( COS( “ /2) ) 

( sin(a/2) ) 


[10.29] 


[10.30] 


The exact solution to the time-dependent Schrodinger equation is (Problem 10.3) 

I jcos(Af/2) + i sin(Af/2)j cos(a/2)cT ^ f > 2 ^ ^ 




\ |^cos(Af/2) + i sin(Ar/2)j sin(a/2)e* wf//2 


where 


A = -yj(o 2 + a> 2 + 2(oa>i cos a, 
or, writing it as a linear combination of x+ an d x~ > 
t (an + cd cos o 


[10.32] 


x(t)-- 


j cos ^ + sin e ., m/2x+(t) 

+ i ^sinasin^yjje -i "' /2 X-(0- 


[10.33] 
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Evidently the (exact) probability of a transition to spin down (along the current di¬ 
rection of B) is 

sinasin^y^j . [10.34] 

The adiabatic theorem says that this transition probability should vanish in the 
limit T e » T{, where T e is the characteristic time for changes in the Hamiltonian (in 
this case, 1/tu) and 7} is the characteristic time for changes in the wave function [in 
this case, ti/(E + — £’_) = l/oj\ |. Thus the adiabatic approximation means oj oj\\ 
The field rotates slowly, in comparison with the phase of the (unperturbed) wave 
functions. In the adiabatic regime A = oj\, and therefore 


\(x(t)\x~m 2 = \^ 


(x(Olx-(O)l 2 


OJ 

— sin a sin 
.0)1 



0, 


[10.35] 


as advertised. The magnetic field leads the electron around by its nose, with the spin 
always pointing in the direction of B. By contrast, if co co\ then X = oj, and the 
system bounces back and forth between spin up and spin down (Figure 10.6). 



Figure 10.6: Plot of the transition probability, Equation 10.34, in the nonadia- 
batic regime (« «; a>0. 


**Problem 10.3 Check that Equation 10.31 satisfies the time-dependent Schrodinger 
equation for the Hamiltonian (Equation 10.25). Note: This is the same as Problem 
9.19(b), except that now the electron starts out with spin up along B, whereas in 
Equation 9.90 it started out with spin up along z. Also confirm Equation 10.33, 
and show that the sum of the squares of the coefficients is 1, as required for proper 
normalization. 




Sec, 10.2: Berry’s Phase 333 


10.2 Berry’s Phase _ 

10.2.1 Nonholonomic Processes 

Let us return to the classical model I used (in Section 10.1.1) to develop the notion 
of an adiabatic process: a perfectly frictionless pendulum whose support is carried 
around from place to place. I claimed that as long as the motion of the support is very 
slow, compared to the period of the pendulum (so that the pendulum executes many 
oscillations before the support has moved appreciably), it will continue to swing in 
the same plane (or one parallel to it), with the same amplitude (and, of course, the 
same frequency). 

But what if I took this ideal pendulum up to the North Pole, and set it swinging— 
say, in the direction of Portland (Figure 10.7). (For the moment, I’ll pretend the earth 
is not rotating.) Very gently (that is, adiabatically), I carry it down the longitude line 
passing through Portland, and on beyond, down to the equator. At this stage it is 
swinging north-south. Now I carry it (still swinging north-south) partway around the 
equator. And finally, I carry it back up to the North Pole, along the new longitude 
line. It is clear that the pendulum will no longer be swinging in the same plane as 
it was when I set out—indeed, the new plane makes an angle © with the old one, 
where 0 is the angle between the southbound and the northbound longitude lines. 
Now © is equal to the solid angle (£2) subtended (at the center of the earth) by the 
path around which I carried the pendulum. For this path surrounds a fraction 0/2 tt 
of the northern hemisphere, so its area is A = (1 /2) (©/2tt)47t i? 2 = 0R 2 (where R 
is the radius of the earth), and hence 

© = A/R 2 = Q. [10.36] 



Figure 10.7: Itinerary for adiabatic 
transport of a pendulum on the surface 
of the earth. 
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Figure 10.8: Arbitrary path on the surface 
of a sphere, subtending a solid angle Q. 


This is a particularly nice way to express the answer, because it turns out to be 
independent of the shape of the path (Figure 10.8). 7 

Incidentally, the Foucault pendulum is an example of precisely this sort of 
adiabatic transport around a closed loop on a sphere—only this time instead of me 
carrying the pendulum around, I let the rotation of the earth do the job. The solid 
angle subtended by a latitude line # 0 (Figure 10.9) is 

£2 = J sinOdOdcj) = 2n{~ cos 6>) |^ 0 = 2 tt( 1 - cos 6q). [10.37] 

Relative to the earth (which has meanwhile turned through an angle of 2tz), the daily 
precession of the Foucault pendulum is 2tt cos 0 0 —a result that is ordinarily obtained 
by appeal to Coriolis forces in the rotating reference frame, 8 but is seen in this context 
to admit a purely geometrical interpretation. 

A system such as this, which does not return to its original state when transported 
around a closed loop, is said to be nonholonomic. (The “transport” in question need 
not involve physical motion: What we have in mind is that the external parameters of 



Figure 10.9: Path of a Foucault 
pendulum in the course of one day. 


7 You can prove this for yourself, if you are interested. Think of the circuit as being made up of tiny 
segments of great circles (geodesics on the sphere); the pendulum makes a fixed angle with each geodesic 
segment, so the net angular deviation is related to the sum of the vertex angles of the spherical polygon. 

8 See, for example, Jerry B. Marion, Classical Dynamics, 2nd ed. (New York: Academic Press, 
1970), Section 11.4. Geographers measure latitude (A.) up from the equator, rather than down from the 
pole, so cos= sink. 
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the system are changed in some fashion that eventually returns them to their initial 
values.) Nonholonomic systems are ubiquitous—in a sense, every cyclical engine is 
a nonholonomic device: At the end of each cycle the car has moved forward a bit, or 
a weight has been lifted slightly, or something. The idea has even been applied to the 
locomotion of microbes in fluids at low Reynolds number. 9 My project for the next 
section is to study the quantum mechanics of nonholonomic, adiabatic processes. The 
essential question is this: How does the final state differ from the initial state, if the 
parameters in the Hamiltonian are carried adiabatically around some closed cycle? 

10.2.2 Geometric Phase 

If the Hamiltonian is independent of time, then a particle which starts out in the nth 
eigenstate \fn(x), 

Hty„{x) = E n f n {x), 

remains in the n th eigenstate, simply picking up a phase factor: 

V n (x,t) = t n (x)e- iE ’' t ' h . [10.38] 

If the Hamiltonian changes with time, then the eigenfunctions and eigenvalues them¬ 
selves are time dependent: 


H(t)t) = E n (t)ty n {x, t ). [10.39] 

But the adiabatic theorem tells us that when H changes very gradually, a particle 
which starts out in the nth eigenstate will remain in the n lh eigenstate—picking up at 
most a time-dependent phase factor—even as the eigenfunction itself evolves. That 
is to say, 

0 = «-*/•'[10.40] 

The term ( 

0 n (t) = -- f E n (t')dt' [10.41] 

b Jo 

is known as the dynamic phase; it generalizes the “standard” factor (— E n t/h ) to the 
case where E n is a function of time. (You will have encountered dynamical phase 
factors already, if you worked Problems 9.16 and and 10.1.) Any “extra” phase, 
y„(t), is called the geometric phase. At the moment we don’t know what it is, 
or what physical significance (if any) it carries; all we can say is that the adiabatic 
theorem does not rule out such a factor, since the particle is still “in the n th eigenstate”, 
whatever the value of y„ . [More precisely, a measurement of the energy at time t would 
be certain to return the value E„(t).] Indeed, since the eigenvalue equation (Equa- 


9 The pendulum example is an application of Hannay’s angle, which is the classical analog to 
Berry’s phase. For a collection of papers on both subjects, see Alfred Shapere and Frank Wilczek, eds., 
Geometric Phases in Physics, (Singapore: World Scientific, 1989). 
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tion 10.39) and the normalization condition only determine \fj„ (x, t) up to an arbitrary 
phase, and since this arbitrary phase could in principle be chosen independently at 
each instant of time [though in practice we shall always take t) to be a smooth 
function of t], we have to allow for an arbitrary phase factor in Equation [10.40]. 10 
Notice, incidentally, that energy is not conserved here. Of course not: Whoever is 
changing the Hamiltonian is pumping energy into or out of the system. 

If we plug Equation 10.40 into the (time-dependent) Schrodinger equation, 

ih — = H(t)V, [10.42] 

dt 

there emerges a simple formula for the time development of the geometric phase: 

ffi[^V'V>'” - +i^-t„e ie -e in ] 

l dt h dt J 


= [Hir n ]e i9 ”e iy * = E n f n e idn e iy \ 


whence 


dty n dy n 

= 0 . 

dt dt 


[10.43] 

Taking the inner product with \J/ n (which I assume has been normalized), we obtain 


dyn 

dt 


l (fn I 


dt } 


[10.44] 


Now t/r n (x, t) depends on t because there is some parameter R(t) in the Hamil¬ 
tonian that is changing with time. [In Problem 10.1, R{t) would be the width of the 
infinite square well, whose right wall is expanding.] Thus 


so that 


and hence 


dfn „ Sjr„dR 
dt ~ dR dt ’ 




dfn dR 
~dR ) "dt i 


[10.45] 


Yn (0 = 


-jf 


dir n dR , . f 1 

{ ^jR ] lf dt =/ A, 


Rf Hn 


[10.46] 


where R t and Rf are the initial and final values of R(t). In particular, if the Hamil¬ 
tonian returns to its original form after time T, so that R f = R t , then y»(T) = 0 
—nothing very interesting there ! 


10 For this reason, most people assumed until quite recently that the geometric phase was of no 
conceivable physical significance. It was Michael Berry’s inspiration to realize that if you carry the 
Hamiltonian around a closed cycle , bringing it back to its original form at the end, the relative phase at the 
beginning and at the end of the process is a nonarbitrary quantity, with profound physical implications. 
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However, I assumed (in Equation 10.45) that there is only one parameter in 
the Hamiltonian that is changing. Suppose there are N of them: R\{t), R 2 {t), ..., 
in that case 


Hr. 

17 


H^dR^ HndRj Hn dR N _ dR 

dR ] dt dR 2 dt + " dR N dt _( sfn) "dt’ 


[10.47] 


where R = ( R\, R n , ..., R #), and is the gradient with respect to these parameters. 

This time we have 


r R j 

yn(t) = i J R [10.48] 


and if the Hamiltonian returns to its original form after a time T, the net geometric 
phase change is 


Yn(T)=i 


Hn\V R ir n )-dVL. 


[10.49] 


This is a line integral around a closed loop in parameter space, and it is not, in 
general, zero. Equation 10.49 was first obtained by Berry in 1984, n and y»(T) is 
called Berry’s phase. Notice that y„ (T) depends only on the path taken, not on how 
fast that path is traversed (provided, of course, that it is slow enough to validate the 
adiabatic hypothesis). By contrast, the accumulated dynamic phase, 

&n(T) = ~£ E n {t>)df, 


depends critically on the elapsed time. 

The derivation of Berry’s phase raises several questions, which I would like to 
address before turning to some examples and applications. 

1. Is y n (t) real? If it’s not, then e iy " is not a phase factor at all, but an ex¬ 
ponential factor, and the normalization of (in Equation 10.40) is lost. Since the 
time-dependent Schrodinger equation conserves probability, it must preserve nor¬ 
malization. It would be comforting to check this, by showing explicitly that Equa¬ 
tion 10.48 yields a real y n . In fact, this is very easy to do. First note that 


V R (fn\ty n ) -0 [10.50] 

(because by assumption is normalized). So 

{V R f n \it n ) + {fnV^Rfn) = {fn\^ R fnT + {^nV^Rtn) = 0 . 

Since {\jr n \SI R ^ n ) plus its complex conjugate is zero, it follows that 

(fn\^Rf n ) is imaginary, [10.51] 

and hence, by Equation 10.48, y„(t ) is real. [Incidentally, if i fr„ itself is real, then so 


n M. V. Berry, Proc. R. Soc. Lond. A 392, 45 (1984), reprinted in Shapere and Wilczek, (footnote 
9). It is astonishing, in retrospect, that such a fundamental result escaped notice for 60 years. 
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(obviously) is {\js n | V* )—this quantity is both real and imaginary, and it must there¬ 
fore be zero. Evidently the geometric phase vanishes whenever the eigenfunctions 
(of H(t)) are real.] 

2. Is Berry’s phase measurable? We are accustomed to thinking that the 
phase of the wave function is arbitrary—physical quantities involve |Vk| 2 , and the 
phase factor cancels out. But y n (T) can be measured, if (for example) we take a 
beam of particles (all in the state 'l') and split it in two, so that one beam passes 
through an adiabatically changing potential, while the other does not. When the two 
beams are recombined, the total wave function has the form 

^ = ^o + ^oe‘ T , [10.52] 

where 'ko is the “direct” beam wave function, and V is the extra phase (in part dynamic, 
and in part geometric) acquired by the beam subjected to the varying H ). In this case 

l*l 2 = I|'J'ol 2 (l + e' r )(l + <r ,r ) 

= il'I'ol 2 !! +C0SO = |%| 2 cos 2 (r/2). [10.53] 

So by looking for points of constructive and destructive interference (where F is an 
even or odd multiple of it, respectively), one can easily measure F. (Berry, and 
other early writers, worried that the geometric phase might be swamped by a larger 
dynamic phase, but it has proved possible to arrange things so as to separate out the 
two contributions.) 

3. Where does the derivation invoke the adiabatic hypothesis? At first 
glance, the argument going from Equation 10.40 to Equation 10.48 appears to have 
proved altogether too much. Why doesn’t the derivation work in reverse, showing that 
as long as y n ( t ) is given by Equation 10.48, the expression in Equation 10.40 satisfies 
the Schrodinger equation exactly — whether or not the process is adiabaticK l) (This 
would, of course, be nonsense; it would imply that the adiabatic theorem is empty: 
No transitions ever occur, even if the change in the Hamiltonian is far from gradual.) 
The answer is that the step following Equation 10.43, in which we take the inner 
product, cannot in general be reversed: Although Equation 10.43 implies Equa¬ 
tion 10.44, Equation 10.44 does not imply Equation 10.43. In fact, there is a serious 
fraud in the derivation, which I did not confess at the time because it somewhat spoils 
the beauty of the argument. The truth is that although Equation 10.44 is correct, Equa¬ 
tion 10.43 is not. 12 For Equation 10.40 is only true in the extreme adiabatic limit—the 


12 Indeed, if you take Equation 10.43 at face value, it can be solved directly for y n : 

= iWIn^„) => f n (x,t) = 4>„{x)e~ iyn{t \ 
at d t 

and hence (going back to Equation 10.40), 

* n (x,t) = 4 , n{ x)e- L *h E " (t ’ W . 

The geometric phase, in effect, soaks up the time dependence acquired by the eigenfunction i/r„ (x, t) as a 
consequence of the change in H. But this is completely false, as we shall see in the examples. 
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exact solution would contain admixtures of other states: 


V n (x, t ) = f n (x, c m (t)f m (x, t), [10.54] 

m^n 


where € = T t /T e characterizes the departure from adiabaticity (it goes to zero in the 
adiabatic limit). Inclusion of this term modifies Equation 10.43, to read 


3f n 

~JT 






Both terms on the left are first order in € (if the Hamiltonian didn’t change at all , 
both df n /dt and y n would be zero), but so are the first two terms on the right. The 
final term is second order, so it can legitimately be ignored, but dropping the first two 
(as I did, implicitly, in my derivation of Equation 10.43), is illegal. For consistency 
(noting, while I’m at it, that y n is already first order, so e iy " = 1 on the right), I should 
have written 


9l jf n 

+ 


dyn 

dt 


[10 - 56] 


instead of Equation 10.43. Fortunately, the inner product (with fi n ) kills the extra 
term, and that’s how it comes about that Equation 10.44 is correct, even though 
Equation 10.43, from which it was obtained, was not. (See Problem 10.7.) 

When the parameter space is three dimensional, R = (R u R 2 , R 2 ), Berry’s 
formula (Equation 10.49) is reminiscent of the expression for magnetic flux in terms 
of the vector potential A. The flux, d>, through a surface S bounded by a curve C 
(Figure 10.10), is 

<t> = J B da. [10.57] 

If we write the magnetic field in terms of the vector potential (B = V x A), and apply 
Stokes’ theorem: 

<I> = J (V x A) • da = £ A • dr. [10.58] 



Figure 10.10: Magnetic flux through a 
surface S bounded by the closed curve C. 
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Thus Berry’s phase can be thought of as the “flux” of a “magnetic field” 

“B” = iV R x <Vr n |V*^„), [10.59] 

through the (closed-loop) trajectory in parameter space. In the three-dimensional 
case, then, Berry’s phase can be written as a surface integral, 

y n (T) - i J [Vi? x • da. [10.60] 

The magnetic analogy can be carried much further, but for our purposes Equa¬ 
tion 10.60 is merely a convenient alternative expression for y n (T). 


❖Problem 10.4 

(a) Use Equation 10.46 to calculate the geometric phase change when the infinite 
square well expands adiabatically from width uq to width uq. Comment on this 
result. 

(b) If the expansion occurs at a constant rate {dw/dt = u), what is the dynamic 
phase change for this process? 

(c) If the well now contracts back to its original size, what is Berry’s phase for the 
cycle? 


Problem 10.5 The delta-function well (Equation 2.96) supports a single bound 
state (Equation 2.111). Calculate the geometric phase change when a gradually 
increases from cq to a?. If the increase occurs at a constant rate (da/dt = c), what 
is the dynamic phase change for this process? 

Problem 10.6 As I noted in the text (and Problems 10.4 and 10.5 confirm), if 
yj/ n (x , t) is real , the geometric phase vanishes. You might try to beat this rap by 
tacking an unnecessary (but perfectly legal) phase factor onto the eigenfunctions: 
^' n (x, t) = e l ^ n \jr n (x, t ), where 0„(R) is an arbitrary (real) function. Try it. You’ll 
get a nonzero geometric phase, all right, but note what happens when you put it back 
into Equation 10.40. And for a closed loop it gives zero. Moral. For nonzero Berry’s 
phase, you need (1) more than one time-dependent parameter in the Hamiltonian, and 
(2) a Hamiltonian that yields nontrivially complex eigenfunctions. 


10.2.3 An Example 

The classic example of Berry’s phase is an electron at the origin, subjected to a 
magnetic field of constant magnitude but changing direction. Consider first the special 
case (analyzed in Section 10.1.3) in which B(t) precesses around at a constant angular 
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velocity co, making a fixed angle a with the z-axis. The exact solution (for an electron 
that starts out with “spin up” along B) is given by Equation 10.33. In the adiabatic 
regime, co co\, 


X = cou 1 + 2— cos a + ( — ) = &>i ( 1 + — cos a: ) = co j + cocos a, [10.61] 

V co i \m) V co\ ) 

and Equation 10.33 becomes 

X (t) £ e i^l2 e i(^ a )t!2 e -ia>t!Z x+{t) 

+ i sin a sin j e~ iwt/2 X -(t). [10.62] 

As co/coi 0 the second term drops out completely, and the result matches the 
expected adiabatic form (Equation 10.40). The dynamic phase is 

6 + (t) = -1 jf E + (t')dt' = ^ [10.63] 

(where £+ = ~hoo\ /2 is taken from Equation 10.29), so the geometric phase is 

y+(0 = (cosof - l)y. [10.64] 

For a complete cycle T — Injoj, and therefore Berry’s phase is 

y + (T) — 7r(coscif — 1). [10.65] 

Now consider the more general case, in which the tip of the magnetic field 
vector sweeps out an arbitrary closed curve on the surface of a sphere of radius 
r = B 0 (Figure 10.11). The eigenstate representing spin up along B(f) has the form 
(see Problem 4.31) 


X+ 


/ cos (0/2) \ 

\ e i4> sin(0/2 )) ’ 


[ 10 . 66 ] 



Figure 10.11: Magnetic field of constant 
magnitude but changing direction sweeps 
out a closed loop. 
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where 0 and 0 (the spherical coordinates of B) are now both functions of time. 
Looking up the gradient in spherical coordinates, we find 


Hence 


dx+ „ 1 $x+ ~ 

vx + = -£ ± ? + —£re- 

Br r B6 


r sin# 90 


_ 1 / —(1/2) sin(0/2) \ * _±_ ( 0 N g 

- r \ (I /'2)^ cos(0/2> J runO [ie 1 * sin(0/2) J v ' 

(X+\V X+ ) = ^[-sin(e/2)cos(S/2)S + sin(9/2) cos(fl/2) 0 


[10.67] 


+ 2 i 


. sin 2 (6>/2) 


*]• 


.sin 2 (9/2) 


sin 0 J rsin^ 
For Equation 10.60 we need the curl of this quantity: 


[ 10 . 68 ] 


Vx 


1 9 f. (i sin 2 (6>/2) \ 


According to Equation 10.60, then, 

y + (0 = -i/2 


r • dsi. 


r = -~ 2 r. [10.69] 


[10.70] 


The integral is over the area on the sphere swept out by B in the course of the cycle, 
so d& — r 2 d£2r, and hence 

y+( T ) = -jf <i Q = -j a ’ [ 10 . 71 ] 

where Q is the solid angle subtended at the origin. This is a delightfully simple 
result, and tantalizingly reminiscent of the classical problem with which we began 
the discussion (transport of a frictionless pendulum around a closed path on the surface 
of the earth). It says that if you take a magnet, and lead the electron’s spin around 
adiabatically in an arbitrary closed path, the net (geometric) phase change will be 
minus one half the solid angle swept out by the magnetic field vector. In view of 
Equation 10.37, the general result is consistent with the special case Equation 10.65, 
as of course it had to be. 


Problem 10.7 Consider, once again, the special case of the precessing field (Sec¬ 
tion 10.1.3). 


(a) Use the eigenspinor (Equation 10.27) to determine (x+\(Bx+/Bt)), and put the 
result into Equation 10.44, for an alternative derivation of Equation 10.64. 
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(b) Show that Equation 10.43 does not work, in this case. Use Equation 10.62 to 
determine c_ (in Equation 10.54). Confirm that the last term in Equation 10.55 
is second order in co (don’t forget the € = co/coi out front). Show that y+(t) 
(Equation 10.64) does satisfy the corrected version of Equation 10.43, Equa¬ 
tion 10.56. 


***Problem 10.8 Work out the analog to Equation 10.71 for a particle of spin 1. 
Answer. — £2 (for spin 5 the result is —sQ,/ 2). 


10.2.4 The Aharonov-Bohm Effect 


In classical electrodynamics the potentials {<p and A) 13 are not directly measurable— 
the physical quantities are the electric and magnetic fields: 

9A 

E = —V<p ——, B = V x A. [10.72] 

at 

The fundamental laws of the theory (Maxwell’s equations and the Lorentz force law) 
make no reference to potentials, which are (from a logical point of view) no more 
than convenient but dispensible scaffolding for getting a better purchase on the real 
structure (the fields). Indeed, you’re perfectly free to change the potentials: 

<p <p' = <p ——, A —► A' — A -j- VA, [10.73] 


where A is an arbitrary function of position and time; this is called a gauge transfor¬ 
mation, and it has no effect at all on the fields. 

In quantum mechanics the potentials play a more significant role, for the Hamil¬ 
tonian (Equation 4.201) is expressed in terms of tp and A, not E and B: 


ff= i(7 v_,A ) +qv - 


[10.74] 


Nevertheless, the theory is still invariant under gauge transformations (see Problem 
4.53), and it was taken for granted until quite recently that there could be no elec¬ 
tromagnetic influences in regions where E and B are zero—any more than there can 
be in the classical theory. But in 1959 Aharonov and Bohm 14 showed that the vector 
potential can affect the quantum behavior of a charged particle that never encoun¬ 
ters an electromagnetic field. I’ll work out a simple example first, then discuss the 


13 I’m sorry, but we have reached a notational impasse: It is customary in quantum mechanics to use 
the letter V for potential energy, but in electrodynamics the same letter is reserved for the scalar potential. 
To avoid confusion I’ll use <p for the scalar potential. See Problems 4.51, 4.52, and 4.53 for background 
on this material. 

14 Y. Aharonov and D. Bohm, Phys. Rev. 115, 485 (1959). For a significant precursor, see W. 
Ehrenberg and R. E. Siday, Proc. Phys . Soc. London B62, 8 (1949). 
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Aharanov-Bohm effect itself, and finally indicate how it can be thought of as an 
example of Berry’s phase. 

Imagine a particle constrained to move in a circle of radius b (a bead on a 
wire ring, if you like). Along the axis runs a solenoid of radius a < b, carrying a 
magnetic field B (see Figure 10.12). If the solenoid is extremely long, the field inside 
is uniform, and the field outside is zero. But the vector potential outside the solenoid 
is not zero; in fact (adopting the convenient gauge condition V • A = 0), 15 

A = -^-0, (r > a), [10.75] 


where <t> = it a 2 B is the magnetic flux through the solenoid. Meanwhile, the solenoid 
is uncharged, so the scalar potential <p is zero. In this case the Hamiltonian (Equa¬ 
tion 10.74) becomes 


H=— \-h 2 V 2 + q 2 A 2 + 2ihqA ■ V] . [10.76] 

2m L 

But the wave function depends only on the azimuthal angle (f >, (0 = it/2 and r = b) 
so V ( 4>/b){d/d4 j), and the Schrodinger equation reads 


1 h 2 d 2 / q<& \ 2 bq® d 
2m b 2 d(j> 2 + \2itb) + l 7tb 2 d<p 


ir((f>) = E\ft((f)). 


[10.77] 



Figure 10.12: Charged bead on a circular ring through which a long solenoid passes. 
This is a linear differential equation with constant coefficients: 


d 2 f n df 

— -2/^-£-+€V r = 0, 
dtp 2 dtp 


[10.78] 


15 See, for instance, D. J. Griffiths, Introduction to Electrodynamics, 2nd ed. (Englewood Cliffs, 
NJ: Prentice Hall, 1989), Equation 5.65. 
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where 

q<t> 2mb 2 E n2 

[10.79] 

Solutions are of the form 

and € =-=- fr. 

2 7th h 2 

with 

f = Ae iX(p , 

[10.80] 

k = 

f ± sfp 2 +e =f± h -sFhmE. 

h 

[10.81] 


Continuity of at 0 = 2n, requires that A. be an integer. 


P± 


-s/2mE = n, 
h 


[10.82] 


and it follows that 


E t 



(n = 0, ±1, ±2, 


[10.83] 


The solenoid lifts the twofold degeneracy of the bead on a ring (Problem 2.43): 
Positive n, representing a particle traveling in the same direction as the current in 
the solenoid, has a somewhat lower energy (assuming q is positive) than negative n, 
describing a particle traveling in the opposite direction. And, more important, the 
allowed energies clearly depend on the field inside the solenoid, even though the field 
at the location of the particle is zero. 16 

More generally, suppose a particle is moving through a region where B is zero 
(so V x A = 0), but A itself is not. (I’ll assume that A is static, although the method 
can be generalized to time-dependent potentials.) The (time-dependent) Schrodinger 
equation, 


1 

2m 


jV-?a) 


+ V 


= ih 


9vp 


[10.84] 


with potential energy V —'which may or may not include an electrical contribution 
q<p —can be simplified by writing 


[10.85] 

where 

S(r) = !j^A(r')-dr', [10.86] 


16 It is a peculiar property of superconducting rings that the enclosed flux is quantized: <t> = 
(2 7rh/q)n r , where n’ is an integer. In this case the effect is undetectable, since E n = (h 2 /2mb 2 )(n + n') 2 , 
and (n + n’) is just another integer. (Incidentally, the charge q here turns out to be twice the charge of 
an electron; the superconducting electrons are locked together in pairs.) However, flux quantization is 
enforced by the superconductor (which induces circulating currents to make up the difference), not by the 
solenoid or the electromagnetic field, and it does not occur in the example considered here. 
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and O is some (arbitrarily chosen) reference point. Note that this definition makes 
sense only when V x A = 0 throughout the region in question—otherwise the line 
integral depends entirely on the path taken from O to r, and hence does not define a 
function of r. In terms of 'I'', the gradient of 4> is 

W = e i8 (iVg)V' + e ig (V4' / ); 


but Vg = (q/h) A, so 


and it follows that 


-e i8 W', 


(jV-«a)« = y 

^V-?A^ 4> = -h 2 e ig V 2 V. 


[10.87] 


[ 10 . 88 ] 


Putting this into Equation 10.84, and canceling the common factor of e ig , we are left 
with 

t, 2 a up 

_ _- v 2 4>' + VV' = ih—-. [10.89] 

2m dt 


Evidently 4*' satisfies the Schrodinger equation without A. If we can solve Equa¬ 
tion 10.89, correcting for the presence of a (curl-free) vector potential is trivial: You 
just tack on the phase factor e lg . 

Aharonov and Bohm proposed an experiment in which a beam of electrons is 
split in two, and passed either side of a long solenoid, before being recombined (Fig¬ 
ure 10.13). The beams are kept well away from the solenoid itself, so they encounter 
only regions where B = 0. But A, which is given by Equation 10.75, is not zero, and 
(assuming V is the same on both sides), the two beams arrive with different phases: 



Figure 10.13: The Aharonov-Bohm effect: electron beam splits, with half pass¬ 
ing either side of a long solenoid. 
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g = ll Xdr= ^I (;*)' (r ^> = ± t- [ia90] 

The plus sign applies to the electrons traveling in the same direction as A—which is 
to say, in the same direction as the current in the solenoid. The beams arrive out of 
phase by an amount proportional to the magnetic flux their paths encircle: 

a <f> 

phase difference = ™. [10.91] 

This phase shift leads to measurable interference (as in Equation 10.53), and has been 
confirmed experimentally by Chambers and others. 17 

The Aharonov-Bohm effect can be regarded as an example of geometric phase, 
as Berry himself noted in his first paper. Suppose the charged particle is confined to a 
box (which is centered at point R outside the solenoid) by a potential V (r — R)—see 
Figure 10.14. (In a moment we’re going to transport the box around the solenoid, 
so R will become a function of time, but for now it is just some fixed vector.) The 
eigenfunctions of the Hamiltonian are determined by 

4^*V-?A(r)j +F(r-R)J^ = £„^r„. [10.92] 

We have already learned how to solve equations of this form: 

ijr n = e ig ir' n , [10.93] 



Figure 10.14: Particle confined to a box, by a potential V(r- R). 


? R. G. Chambers, Phys. Rev. Lett. 5, 3 (1960). 
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where 18 r 

g= \ f A(r') ■ dr' 
h Jr 

and satisfies the same eigenvalue equation, only with A -> 0: 


2m 


V (r ■ 


-R)j i/’ n = E n f' n . 


[10.94] 


[10.95] 


Notice that ^ is a function only of the combination (r — R), not (like t/r„) of r and 
R separately. 

Now let’s carry the box around the solenoid (in this case the process doesn’t 
even have to be adiabatic). To determine Berry’s phase, we must first evaluate the 
quantity (f„\ V*t/r„). Noting that 

VtfV'n = v* [e i8 f' n (r - R)] = -i^A(R)e i8 f' n (r - R) + e ig V R f' n (r - R), 


we find 


= j e~ ig [^(r-R)]*e ig ^- 




(r - R)]V* | -/|A(R)^(r - R) + 


■i(r-R)] 


d s T 


= -i |A(R) - j [^(r - R)]*V^(r - R) d 3 r. [10, 


>.96] 


The V with no subscript denotes the gradient with respect to r, and I used the fact 
that V R — -V, when acting on a function of (r - R). But the last integral is 
i /h times the expectation value of momentum, in an eigenstate of the Hamiltonian 
-(h 2 /2m)V 2 + V, which we know from Section 2.1 is zero. So 

= -i|A(R). [10.97] 

Putting this into Berry’s formula (Equation 10.49), we conclude that 

Yn (T) = | ^A(R) * dR = | j(W x A) • d* = [10.98] 

which neatly confirms the Aharonov-Bohm result (Equation 10.91), and reveals that 
the Aharonov-Bohm effect is a particular instance of geometric phase. 19 


18 It is convenient to set the reference point O at the center of the box, for this guarantees that we 
recover the original phase convention for i/r„ when we complete the journey around the solenoid. If you use 
a fixed point in space, for example, you’ll have to readjust the phase “by hand”, at the far end; this leads to 
exactly the same answer, but it’s a crude way to do it. In general, when choosing the phase convention for 
the eigenfunctions in Equation 10.39, you want to make sure that i ff„ (x, T) = i fr„ (x, 0) so that no spurious 
phase changes are introduced. 

19 Incidentally, in this case the analogy between Berry’s phase and magnetic flux (Equation 10.59) 
becomes almost an identity: “B” = {q/Ti) B. 
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What are we to make of the Aharonov-Bohm effect? Evidently our classical 
preconceptions are simply mistaken. There can be electromagnetic effects in regions 
where the fields are zero. Note, however, that this does not make A itself measurable— 
only the enclosed flux comes into the final answer, and the theory remains gauge 
invariant. 

Problem 10.9 

(a) Derive Equation 10.76, from Equation 10.74. 

(b) Derive Equation 10.88, starting with Equation 10.87. 


FURTHER PROBLEMS FOR CHAPTER 10 _ 

* * ^Problem 10.10 Suppose the one-dimensional harmonic oscillator (mass m, fre¬ 
quency co) is subjected to a driving force of the form Fit) = moo 2 fit), where fit) is 
some specified function [I have factored out mco 2 for notational convenience; notice 
that f{t) has the dimensions of length ]. The Hamiltonian is 

H{t) = ~ + \mco 2 x 2 - moo 2 xf{t). [10.99] 

2m dx l 2 

Assume that the force was first turned on at time t = 0: fit) = 0 for t < 0. 
This system can be solved exactly, both in classical mechanics and in quantum 
mechanics. 20 


(a) Determine the classical position of the oscillator, assuming it started out at rest 
at the origin [x c (0) = x c (0) = 0]. Answer: 

x c {t) ~ a> f fit') sinMf - t')] dt'. [10.100] 

Jo 

(b) Show that the solution to the (time-dependent) Schrodinger equation for this 

oscillator, assuming it started out in the nth state of the undriven oscillator 
[vp(x, 0) = (x), where t^„(x) is given by Equation 2.50], can be written as 

vp(jc, t) = xjr n ix ~ x c )e k i~ (n+ll)hM+mic(x ~^ )+ ^ ^ /(Ox c (0^']_ [10 101] 
(C) Show that the eigenfunctions and eigenvalues of Hit) are 

fnixj) = fnix - f); E„it) = (n+ ^\hco --mco 2 f 2 . [10.102] 


20 See Y. Nogami, Am. J. Phys. 59 ,64 (1991) and references therein. 
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(d) Show that in the adiabatic approximation the classical position (Equation 10.100) 
reduces to x c (t) = f(t). Hint. Use the integration-by-parts trick of Section 
10.1.2. State the precise criterion—analogous to Equation 10.15—for adia- 
baticity. 

(e) Confirm the adiabatic theorem for this example, by using the results in (c) and 
(d) to show that 


VI I(x,t) = f n (x, t)e Wn{t) e iYn{t) . [10.103] 

Check that the dynamic phase has the correct form (Equation 10.41). Is the 
geometric phase what you would expect? 


***Problern 10.11 In time-dependent perturbation theory, we used the completeness 
of the unperturbed eigenfunctions (of H 0 ) to expand vp( x , t) (see Equation 9.81). 
But we could as well use the instantaneous eigenfunctions of H{t) (Equation 10.39), 
since they, too, are complete: 

*(*. 0 = ^2 c n(t)f n (x, t)e i9 \ [10.104] 

n 

where 0„(t) is given by Equation 10.41. We can use this expansion to develop an 
adiabatic series, whose leading term is the adiabatic approximation itself and whose 
successive terms represent the departure from perfect adiabaticity. 

(a) Insert Equation 10.104 into the (time-dependent) Schrodinger equation, and 
obtain the following formula for the coefficients: 

Cm=~ [10.105] 

(b) Suppose the system starts out in the jV th state; in the adiabatic approximation, it 
remains in the jV 1 * 1 state, picking up (at most) a time-dependent geometric phase 
(compare Equations 10.40 and 10.104): 


c n {t) = 8 nN e iyN(t) . [10.106] 

Substitute this into the right side of Equation 10.105, and obtain the “first cor¬ 
rection” to adiabaticity: 


c m (t) = c m (0) 




] dt’. 


[10.107] 


This enables us to calculate transition probabilities in the nearly adiabatic 
regime. To develop the “second correction,” we would insert Equation 10.107 
on the right side of Equation 10.105, and so on. 
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(C) As an example, apply Equation 10.107 to the driven-oscillator (Problem 10.10). 
Show that (in the near-adiabatic approximation) transitions are possible only to 
the two immediately adjacent levels, for which 


cn+ i (0 = i 




dt’. 


c N-i(t) = f Q /V) e ia *'dt'. 

(The transition probabilities are the absolute squares of these, of course.) 



CHAPTER 11 


SCATTERING 


11.1 INTRODUCTION 


11.1.1 Classical Scattering Theory 

Imagine a particle incident on some scattering center (say, a proton fired at a heavy 
nucleus). It comes in with an energy E and an impact parameter b, and it emerges at 
some scattering angle 0 —see Figure 11.1. (I’ll assume for simplicity that the target 
is azimuthally symmetrical, so the trajectory remains in one plane, and that the target 
is very heavy, so the recoil is negligible.) The essential problem of classical scattering 



Figure 11.1 The classical scattering 
problem, showing the impact parameter b 
and the scattering angle 0. 
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theory is this: Given the impact parameter, calculate the scattering angle. Ordinarily, 
of course, the smaller the impact parameter, the greater the scattering angle. 


Example: Hard-sphere scattering. Suppose the target is a billiard ball, of 
radius R, and the incident particle is a BB, which bounces off elastically (Figure 
11.2). In terms of the angle a , the impact parameter is b = R sin a, and the scattering 
angle is 0 = n — 2a, so 

*=*sm (!-!)=*«■(!). 

Evidently 

2cos ~\b/R), \ib < R, 

0, if b>R. 



[H.l] 

[H.2] 


More generally, particles incident within an infinitesimal patch of cross-sectional 
area da will scatter into a corresponding infinitesimal solid angle dQ (Figure 11.3). 
The larger da is, the bigger dQ. will be; the proportionality factor, D(0) = da/dQ, 
is called the differential (scattering) cross-section 1 : 



lr This is terrible language: D isn’t a differential —if anything, it’s a derivative —and it isn’t a 
cross-section. To my ear, the words “differential cross-section” would apply more properly to da. But I’m 
afraid we’re stuck with this terminology. I should also warn you that the notation D(0) is nonstandard: 
Most people just call it do/dQ., but I think it will be less confusing if we give the differential cross-section 
its own symbol. 
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In terms of the impact parameter and the azimuthal angle <p, da = b dbd<p and 
dQ = sin 9d0d<f>, so 


m = -A- ~ 

sin# do 


[11.4] 


(Since 0 is typically a decreasing function of b, the derivative is actually negative— 
hence the absolute value sign.) 


Example: Hard-sphere scattering (continued). 

scattering (Equation 11.1), 



In the case of hard-sphere 


[11.5] 


so 


R cos(#/2) / R sin(#/2) \ _ R 2 
sin# \ 2 ) ~ 4 


[ 11 . 6 ] 


This example is unusual in that the differential cross-section is actually independent 
of 0. 


The total cross-section is the integral of D(Q) over all solid angles: 


as S 


D(0)dQ\ 


[31.7] 


roughly speaking, it is the total area of incident beam that is scattered by the target. 
For example, in the case of the hard sphere, 


tr = (R 2 / 4) 


/ 


dQ = ttR 2 , 


[ 11 . 8 ] 
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which is just what we would expect: It’s the cross-sectional area of the sphere; BBs 
incident within this area will hit the target, and those farther out will miss it completely. 
But the virtue of the formalism developed here is that it applies just as well to “soft” 
targets (such as the Coulomb field of a nucleus) that are not simply “hit or miss.” 

Finally, suppose we have a beam of incident particles, with uniform intensity 
(or luminosity, as particle physicists call it): 

C = number of incident particles per unit area, per unit time. [11.9] 


The number of particles entering area da (and hence scattering into solid angle dQ), 
per unit time, is dN = Cda = CD{9) dS 2, so 




[ 11 . 10 ] 


This is often taken as the definition of the differential cross-section, because it makes 
reference only to quantities easily measured in the laboratory: If the detector accepts 
particles scattering into a solid angle dQ, we simply count the number recorded, per 
unit time, divide by dCl, and normalize to the luminosity of the incident beam. 


***Problem 11.1 Consider the problem of Rutherford scattering: An incident par¬ 
ticle of charge q x and kinetic energy E scatters off a heavy stationary particle of 
charge qi. 


(a) Derive the formula relating the impact parameter to the scattering angle. Note: 
This is not easy, and you might want to refer to a book on classical me¬ 
chanics, such as Jerry B. Marion, Classical Dynamics of Particles and Sys¬ 
tems, 2nd ed. (New York: Academic Press, (1970)), Section 9.5. Answer: 
b = {qxqil^ireoE) cot(0/2). 

(b) Determine the differential scattering cross-section. Answer: 

D ( 0 )=[- ~~2 - 1 • [ 1111 ] 

\_16ttcqE sin 2 (0/2) J 

(c) Show that the total cross-section for Rutherford scattering is infinite. 


11.1.2 Quantum Scattering Theory 

In the quantum theory of scattering, we imagine an incident plane wave, ijr(z) = 
Ae ikz , traveling in the z-direction, which encounters a scattering potential, producing 
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an outgoing spherical wave (Figure 11.4). 2 That is, we will look for solutions to the 
Schrodinger equation of the general form 


f e ikr 1 


t(r,e)*>AU k2 + m - , 

for large r. 

1 r 1 


(The spherical wave must carry a factor of 1 jr, because this portion of I 2 must go 
like 1/r 2 to conserve probability.) The wave number k is related to the energy of the 
incident particles in the usual way: 


k = 



[11.13] 


(As before, I shall assume the target is azimuthally symmetrical; in the more general 
case the amplitude / of the outgoing spherical wave could depend on <p as well as 6.) 

The whole problem is to determine the scattering amplitude /(0); it tells 
you the probability of scattering in a given direction 0 , and hence is related to the 
differential cross-section. Indeed, the probability that the incident particle, traveling 
at speed v, passes through the infinitesimal area da, in time dt, is (see Figure 11.5) 

dP = I ^incident 1 2 d V = \A\ 2 (vdt)do. 



Figure 11.4: Scattering of waves; incoming plane wave generates outgoing 
spherical wave. 


2 For the moment, there’s not much quantum mechanics in this; what we’re really talking about 
is the scattering of waves, as opposed to classical particles, and you could even think of Figure 11.4 
as a picture of water waves encountering a rock, or (better, since we’re interested in three-dimensional 
scattering) sound waves bouncing off a basketball. In that case we’d write the wave function in the real 
form 

/4{cos(kz)-f /(0)cos(Ar + S)/r], 

and /(#) would represent the amplitude of the scattered sound wave in the direction 6. 
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vdt 


Figure 11.5: The volume dV of incident beam that passes through area da in time dt. 

But this is equal to the probability that the particle later emerges into the corresponding 
solid angle dQ: 

dP = \t^\ 2 dv = 2?df {vdt ydn, 


from which it follows that da = \f\ 2 dQ , so 


D(0) = 


da 

dQ 


\m\ 2 . 


[11.14] 


Evidently the differential cross-section (which is the quantity of interest to the ex¬ 
perimentalist) is equal to the absolute square of the scattering amplitude (which is 
obtained by solving the Schrodinger equation). In the next sections we will study 
two techniques for calculating the scattering amplitude: partial wave analysis and 
the Bom approximation. 

Problem 11.2 Construct the analogs to Equation 11.12 for one-dimensional and 
two-dimensional scattering. 


11.2 PARTIAL WAVE ANALYSIS 


11.2.1 Formalism 


As we found in Chapter 4, the Schrodinger equation for a spherically symmetrical 
potential V (r) admits the separable solutions 

\Js(r, 6, (p) = R{r)Y™(9,<p), [11.15] 


where 7/” is a spherical harmonic (Equation 4.32) and u(r) = rR(r) satisfies the 
“radial equation” (Equation 4.37): 


h 2 d 2 u 
2m dr 2 


+ 


V(r) + 


n 2 /(/ + i) " 

2m r 2 


u = Eu. 


[11.16] 
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At very large r the potential goes to zero, and the centrifugal term is negligible, so 


d 2 u 
dr 2 


tt - -kru. 


The general solution is 

u(r) = Ce ikr + De~ ikr \ 

the first term represents an outgoing spherical wave, and the second an incoming 
one—for the scattered wave, we evidently want D = 0. At very large r, then, 



as we already deduced (on qualitative grounds) in the previous section (Equation 

11 . 12 ). 

That’s for very large r (more precisely, for kr » 1; in optics it would be called 
the radiation zone). As in one-dimensional scattering theory, we assume that the 
potential is localized, in the sense that exterior to some finite scattering region it is 
essentially zero (Figure 11.6). In the intermediate region (where V can be ignored 
but the centrifugal term cannot), 3 the radial equation becomes 


d 2 u 
dr^ 


1(1 + 1 ) 

— u 


-k 2 u. 


[11.17] 


and the general solution (as we found in Section 4.1.3) is a linear combination of 
spherical Bessel functions (Equation 4.45): 


u(r) = Arji(kr) + Brm(kr). [11.18] 

However, neither fi (which is something like a sine function) nor n, (which is a sort 
of generalized cosine function) represents an outgoing (or an incoming) wave. What 
we need are the linear combinations analogous to e ikr and e~ ikr ; these are known as 

spherical Hankel functions: 

^/ (1 V) - J)(x) + ini(x); hf\x) = j)(x) - infix). [11.19] 

The first few spherical Hankel functions are listed in Table 11.1. At large r, hf\kr) 
(the “Hankel function of the first kind”) goes like e ikr / r, whereas hf ] (kr) (the “Hankel 
function of the second kind”) goes like e~ lkr jr; for outgoing waves we evidently need 
spherical Hankel functions of the first kind: 

R(r) = Chf\kr). [11.20] 


2 What follows does not apply to the Coulomb potential, since 1 jr goes to zero more slowly than 
l/r , as r oo, and the centrifugal term does not dominate in this region. In this sense the Coulomb 
potential is not “localized.” 
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Figure 11.6: Scattering from a 
localized potential: the scattering region 
(shaded), the intermediate region (where 
V = 0), and the radiation zone 
(where fcr > 1). 


Thus the exact wave function, in the exterior region [where V (r) = 0], is 


\jr(r, 6,<f>) = A 


e ,b + J2 C i.XH^)Yr(e,<P) 


[ 11 . 21 ] 


Now, for very large r, the Hankel function goes like {—i) l+l e ,kr /kr (Table 11.1), so 


i Hr, A\ e lkz + f(0,4>) 


[ 11 . 22 ] 


where 


f(6,<p ) = i^(-o' + 'C/.„r,"(e,0). [H.23] 


This confirms more rigorously the general structure postulated in Equation 11.12, 
and tells us how to compute the scattering amplitude, f(0, <p), in terms of the partial 
wave amplitudes C Um . Evidently the differential cross-section is 

• ni.24] 

K l,m I'.m' 


Table 11.1: Spherical Hankel functions, h^'(x) and hf\x). 


h ? = (rji- \)* iz = 

h V = (-*-$ + L) e * = 

hf ] -► \ exp (+«' [x - §(/ + 1)]} | 
h\ 2) \ exp {-/ [x - § (/ + 1)]} I 


for* >> 1 




360 Chap. 11 Scattering 


and the total cross-section is 

a f = pE i^i 2 - t |1,25 J 

l,m V,m' ^ l,m 

(I used the orthonormality of the spherical harmonics, Equation 4.33, in the last step.) 

In the previous paragraph I kept the possible <p dependence because it cost me 
nothing. But if (as is ordinarily the case) V is independent of 4>, then only terms with 
m = 0 survive (remember, Yp ~ e im<i> ). Now (from Equations 4.27 and 4.32) 

if (0,0) = ^^Ylp,(cos0), [11.26] 

where Pj is the Ith Legendre polynomial. So for the case of azimuthal symmetry, the 
exact wave function (in the exterior region) is 


00 j2i JL 1 

\}/(r,9) = A e lkz + ^ V ~7— Cjhj^ (kr) Pi (cos 0) 

i—n ’ ^7T 


[11.27] 


the scattering amplitude is 


1 00 I II 4- 1 

/( 0 ) = 7 ~Z - c / p l(cos9); 

K /=o ’ 47r 

and the total cross-section is 


[11.28] 


<7 = 


I oo 

sEi c 'i 2 


k 2 


[11.29] 


11.2.2 Strategy 

All that remains is to determine the partial wave amplitudes C/ for the potential in 
question. This is accomplished by solving the Schrodinger equation in the interior 
region [where V (r) is distinctly nonzero] and matching this to the exterior solution 
(Equation 11.27), using the appropriate boundary conditions. But first I need to do 
a little cosmetic work, because as it stands my notation is hybrid: I used spherical 
coordinates for the scattered wave, but Cartesian coordinates for the incident wave. 
Before proceeding, it is useful to rewrite the wave function in a more consistent 
notation. 
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Of course, e lkz satisfies the Schrodinger equation with V = 0. On the other 
hand, I just argued that the general solution to the Schrodinger equation with V = 0 
can be written in the form 


Y.[A,, m Mkr) + B,„n,(k r )]Yr(e, 4,). 


In particular, then, it must be possible to express e lkz in this way. But e ,kz is finite 
at the origin, so no Neumann functions are allowed in the sum [nj(kr) blows up at 
r — 0], and since z = r cos 0 has no <j> dependence, only m = 0 terms occur. The 
expansion of a plane wave in terms of spherical waves is sometimes called Rayleigh’s 
formula 4 : 

00 

e ikz = J^V(2/+ l)j l (kr)Pi(cos0). [11.30] 

/=o 

Thus the wave function, in the exterior region, can be written in the more consistent 
form 


if(r,e) 


- a L 

1=0 


i l (2l + 1 )ji(kr) + yf^^-Qh^ikr) 


P/(cos0).[11.31] 


Example: Hard-sphere scattering. Suppose 
V(r) = 


The boundary condition, then, is 


oo, for r < a, 
0, for r > a. 


\{f (a, 0) = 0, 


[11.32] 


[11.33] 


7 (2/ + l)ji(ka) + y Cihf ] (ka) 

for all 0, from which it follows (Problem 11.3) that 


P/(cos6*) = 0 [11.34] 


[11.35] 


In particular, the total cross-section is 

4 Jt 
*2 


k 1=0 


ji(ka) 


hf\ka) 


[11.36] 


4 For a guide to the proof, see George Arfken, Mathematical Methods for Physicists, 3rd ed. 
(Orlando, FL: Academic Press, 1985), Exercise 12.4.7, page 665. 
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That’s the exact answer, but it’s not terribly illuminating, so let’s consider the 
limiting case of low-energy scattering: ka <£\. (Since k = 2 jt/X, this amounts to 
saying that the wavelength is much greater than the radius of the sphere.) Referring 
to Table 4.3, we note that nfz) is much larger than jfz), for small z, so 

Mz) = ji(z ) ^ . ji(z) 

h\ l) (z) Ji(z) + ini(z) 1 n/(z) 


^ , 2 l l\z l /(21 + 1)! _J_ T2!n_l 2 2/+I 

~{2l)\z~ l ~ l /2 l l\ 2/ + 1 j_ (21 )! J 

and hence 


[11.37] 



But we’re assuming ka <£ 1, so the higher powers are negligible—in the low-energy 
approximation the scattering is dominated by the / = 0 term. (This means that the 
differential cross-section is independent of 6, just as it was in the classical case.) 
Evidently 


a^4na 2 , [11.38] 

for low-energy hard-sphere scattering. Surprisingly, the scattering cross-section is 
four times the geometrical cross-section—in fact, a is the total surface area of the 
sphere . This “larger effective size” is characteristic of long-wavelength scattering (it 
would be true in optics, as well); in a sense, these waves “feel” their way around the 
whole sphere, whereas classical particles only see the head-on cross-section. 


Problem 11.3 Derive Equation 11.35, starting with Equation 11.34. 

**Problem 11.4 Consider the case of low-energy scattering from a spherical delta- 
function shell: 


V (r) = a8(r — a), 

where a and a are constants. Calculate the scattering amplitude /(6>), the differential 
cross-section D(0), and the total cross-section a. Assume ka <$; 1, so that only 
the l = 0 term contributes significantly. (To simplify matters, throw out all / ^ 
0 terms right from the start. The main problem, of course, is to determine C 0 .) 
Express your answer in terms of the dimensionless quantity 0 = 2 maa/h 2 . Answer: 
a = 4na 2 (j) 2 f{\ + 0) 2 . 
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11.3 THE BORN APPROXIMATION __ 

11.3.1 Integral Form of the Schrodinger Equation 

The time-independent Schrodinger equation, 

- T '-S/ 2 ir + Vir = Ef, [11.39] 

2m 

can be written more succinctly as 

(v 2 +i 2 )f = e. tn-4°] 

where _ 

t=— and Qs—rif. [11.41] 

n /r 

This has the superficial form of the Helmholtz equation; note, however, that the 
“inhomogeneous” term ( Q ) itself depends on f. Suppose we could find a function 
G(r) that solves the Helmholtz equation with a delta-function “source”: 

(V 2 + k 2 )G(r) = <5 3 (r). [11-42] 

Then we could express f as an integral: 

x/,(r) = J G(r-ro)e(ro)d 3 r 0 . [11.43] 

For it is easy to show that this satisfies Schrodinger’s equation, in the form of Equation 
11.40: 

(V 2 4- k 2 )\Js(r) = j [(V 2 4* k 2 )G(r — ro)] Q(ro) d 3 ro 
= f ~ r o)(?( r o) d 3 r 0 = Q( r). 

G(r) is called the Green’s function for the Helmholtz equation. (In general, the 
Green’s function for a given differential equation represents the “response” to a delta- 
function source.) 

Our first task 5 is to solve Equation 11.42 for G(r). This is most easily accom¬ 
plished by taking the Fourier transform, which turns the differential equation into an 
algebraic equation. Let 

GW = j e ‘ s r g(s)dh. [11.44] 

5 Warning: You are approaching two pages of heavy analysis, including contour integration; if you 
wish, skip straight to the answer. Equation 11.55. 
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Then 


(V 2 + A 2 )G(r ) = ~y^ j [(V 2 + *V sr ] g(s)rf 3 s. 


But 

and (see Equation 2.126) 


vV s ■ r = - 


i3(r) =^/ e ' srA ' 


[11.45] 

[11.46] 


so Equation 11.42 says 


(2tt)3/2 

It follows 6 that 


d 3 s 


g(s) = ■ 


(27T) 3 / 2 (A: 2 — s 2 ) ’ 
Putting this back into Equation 11.44, we find 


G(r) = ■ 


1 




1 


-d 3 s. 


[11.47] 


[11.48] 


(27r) 3 J (k 2 - s 2 ) 

Now r is fixed , as far as the s integration is concerned, so we may as well choose 
spherical coordinates (s, 0, </>) with the polar axis along r (Figure 11.7). Then s • r = 
sr cos 0, the </> integral is trivial (2 jt), and the 0 integral is 

- , ‘ srcos6 ’ l * 2sin(jr) 


r- 

7o 


sin0t/6> = -- 


[11.49] 


Thus 


. 12 ssin(sr) , 1 ssin(sr) , 

G(r)= a^rj 0 V-si dS= 4^~r J_ x F - *2 [11 - 50] 

The remaining integral is not so simple. It pays to revert to exponential notation 
and factor the denominator: 


G(r) = 


i h\C 


-ds 




(s - k)(s + fe)”“ J_oo (J — £)($ + k) 

= I^7 (/l_/2) - 




These two integrals can be evaluated using Cauchy’s integral formula: 

m 


(z - zo) 


dz = 2jiif(zo), 


[11.51] 


[11.52] 


6 This is clearly sufficient, but it is also necessary, as you can easily show by combining the two 
terms into a single integral and using Plancherel’s theorem, Equation 2.85. 



Sec. 11.3: The Born Approximation 365 



Figure 11.7: Convenient coordinates for 
the integral in equation [11.48]. 


if zo lies within the contour (otherwise the integral is zero). In the present case the 
integration is along the real axis, and it passes right over the pole singularities at 
±k. We have to decide how to skirt the poles—I’ll go over the one at —k and under 
the one at +k (Figure 11.8). (You’re welcome to choose some other convention if 
you like—even winding seven times around each pole; you’ll get a different Green’s 
function, but, as I’ll show you in a minute, they’re all equally acceptable.) 

For each integral in Equation 11.51 I must “close the contour” in such a way 
that the semicircle at infinity contributes nothing. In the case of I\, the factor e lsr 
goes to zero when s has a large positive imaginary part; for this one I close above 
(Figure 11.9a). The contour encloses only the singularity at s = +&, so 


h 



—ds — 2iti 
s — k 



= irte 


ikr 


[11.53] 


In the case of I 2 , the factor e~ lsr goes to zero when s has a large negative imaginary 
part, so we close below (Figure 11.9b); this time the contour encloses the singularity 
at s = —k (and it goes around in the clockwise direction, so we pick up a minus sign): 


h 



—— r ds = —2iti 
s + k 



i=-k 


-ine ikr . [11.54] 



Figure 11.8: Skirting the poles in the contour integral (Equation 11.51). 
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jOl 




(a) (b) 

Figure 11.9: Closing the contour in equations [11.53] and [11.54], 


Conclusion : 

GW = [11.55] 

This, finally, is the Green’s function for the Helmholtz equation—the solution 
to Equation 11.42. (If you got lost in all that analysis, you might want to check the 
result by direct differentiation—see Problem 11.5.) Or rather, it is a Green’s function 
for the Helmholtz equation, for we can add to G(r) any function Go(r) that satisfies 
the homogeneous Helmholtz equation: 


(V 2 4- k 2 )Go(r) = 0; [11.56] 

clearly, the result (G + Go) still satisfies Equation 11.42. This ambiguity corresponds 
precisely to the ambiguity in how to skirt the poles—a different choice amounts to 
picking a different function Go(r). 

Returning to Equation 11.43, the general solution to the Schrodinger equation 
takes the form 


V f (r) = V r o(r)-— j :-- V(r 0 )f(r Q ) </ 3 r 0 , 

2 nh z J |r — r 0 ] 


[11.57] 


where fo satisfies the free particle Schrodinger equation, 

(V 2 + k 2 )i/ 0 = 0. [11.58] 

Equation 11.57 is the integral form of the Schrodinger equation; it is entirely 
equivalent to the more familiar differential form. At first glance it looks like an ex¬ 
plicit solution to the Schrodinger equation (for any potential)—which is too good to 
be true. Don’t be deceived: There’s a if/ under the integral sign on the right-hand 
side, so we can’t do the integral unless we already know the solution! Nevertheless, 
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the integral form can be very powerful, and it is particularly well suited to scattering 
problems, as we’ll see in the following section. 

Problem 11.5 Check that Equation 11.55 satisfies Equation 11.42, by direct sub¬ 
stitution. Hint: V 2 (l fr) = — 4^5 3 (r). 7 

**Problem 11.6 Show that the ground state of hydrogen (Equation 4.80) satisfies 
the integral form of the Schrodinger equation, for the appropriate V and E (note that 
E is negative, so k = iic, where k = y/—2mE/ti). 


11.3.2 The First Bom Approximation 

Suppose V (ro) is localized about ro = 0—that is, the potential drops to zero outside 
some finite region (as is typical for a scattering problem), and we want to calculate 
i/r(r) at points far away from the scattering center. Then |r| |ro| for all points that 
contribute to the integral in Equation 11.57, so 



|r - r 0 | 2 = r 2 + r 2 - 2r- r 0 = r 2 (\ - , 

[11.59] 

and hence 

|r — r 0 | = r-r • r 0 . 

[11.60] 

Let 

k = kr- 

[11.61] 

then 

e ik\r — ro I ~ Q ikr e ~ik ■ r 0 

[11.62] 

and therefore 

„ik\r — r 0 | ikr 



~ e 

[11.63] 


1 r — r 0 1 r 



[In the denominator we can afford to make the more radical approximation 
|r — ro | = r; in the exponent we need to keep the next term. If this puzzles you, try 
writing out the next term in the expansion of the denominator. What we are doing 
is expanding in powers of the small quantity (ro/r) and dropping all but the lowest 
order.] 

In the case of scattering, we want 

fo(r ) = Ae ih . [11.64] 

representing an incident plane wave. For large r, then, 

m e ikr f 

f(r) = Ae ikz - — 3 — / e ~ ik ' r ° V (r 0 )^(r 0 ) d 3 r 0 . [11.65] 

2nh l r J 


7 See, for example, D. J. Griffiths, Introduction to Electrodynamics, 2nded. (Englewood Cliffs, NJ: 
Prentice Hall, 1989), p. 52. 
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This is in the standard form (Equation 11.12), and we can read off the scattering 
amplitude: 

/(M) = -^ 2 ^ J «" ik r °V(ro)Vf(ro)d 3 r 0 . [11.66] 

So far, this is exact. Now we invoke the Born approximation: Suppose the 
incoming plane wave is not substantially altered by the potential, then it makes sense 
to use 

VKro) ^o(r 0 ) = Ae ikz ° = Ae ik ' r °, [11.67] 

where 

k' = kz, [11.68] 

inside the integral. (This would be the exact wave function, if V were zero; it is 
essentially a weak potential approximation.) In the Bom approximation, then, 

/«>>« = -- TtP [1169] 

(In case you have lost track of the definitions of k and k', they both have magnitude 
k, but the former points in the direction of the incident beam, while the latter points 
toward the detector—see Figure 11.10.) 

In particular, for low-energy (long-wavelength) scattering, the exponential 
factor is essentially constant over the scattering region, and the Bom approximation 
simplifies to 


/ (9, <P) = ~ ^~2 J V (r) d 3 r, (low energy). [11.70] 


(I dropped the subscript on r, since there is no occasion for confusion at this point.) 



Figure 11.10: Two wave vectors in the 
Born approximation: k points in the incident 
direction, k' in the scattered direction. 
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Example: Low-energy soft-sphere 8 scattering. Suppose 
f Vo, if r < a. 


K(r) =l0, ifrx. 

In this case the low-energy scattering amplitude is 


’(H 

(independent of 6 and <p), the differential cross-section is 




do 
dQ : 


Mm- 


and the total cross-section is 


•nm- 


[11.71] 


[11.72] 


[11.73] 


[11.74] 


For a spherically symmetrical potential, V{r) — V(r), (but not necessarily 
at low energy), the Bom approximation again reduces to a simpler form. Define 

* = k'-k, [11.75] 

and let the polar axis for the ro integral lie along k, so that 

(k' - k) ■ r 0 = icro cos % [11.76] 

Then 

fW) ~ / eiKr " cos9o V(ro)rlsmdodrodeod^o. [11.77] 

The (fio integral is trivial (2 tz), and the Bq integral is one we have encountered before 
(see Equation 11.49). Dropping the subscript on r, we are left with 


m 


^ 2m f 

ti 2 K Jo 


rV(r) sin(/cr) dr, (spherical symmetry). 


[11.78] 


8 You can’t apply the Born approximation to hard -sphere scattering (Vo — oo)—the integral blows 
up. The point is that we assumed the potential is weak, and doesn’t change the wave function much in the 
scattering region. But a hard sphere changes it radically —from Ae ikz to zero. 
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The angular dependence of / is carried by k; from Figure 11.10 we see that 

k = 2&sin(0/2). [11.79] 


Example: Yukawa scattering. The Yukawa potential (which is a crude 
model for the binding force in an atomic nucleus) has the form 


V(r)=P -, 

r 

where P and fi are constants. The Bom approximation gives 


e^'sm (Kr)dr 

h K Jo 


Imp 


Jo h 2 (ii 2 + k 2 ) ' 

(You get to work out the integral for yourself in Problem 11.8.) 


[11.80] 


[11.81] 


Example: Rutherford scattering. If we put in p = qiq 2 f4nco, m = 0, 
the Yukawa potential reduces to the Coulomb potential, describing the electrical 
interaction of two point charges. Evidently the scattering amplitude is 


m = - 

or (using Equations 11.79 and 11.41), 

m = -t 


2mqiq 2 

Alt€oh 2 K 2 ’ 




\6jiCoE sin 2 (6/2) 
The differential cross-section is the square of this: 


da _ [~ qiqi ~| 2 

dQ L 167T eo^sin 2 (0/2) J 


[11.82] 


[11.83] 


[11.84] 


which is precisely the Rutherford formula (Equation 11.11). It happens that for the 
Coulomb potential, classical mechanics, the Bom approximation, and quantum field 
theory all yield the same result. In computer parlance, the Rutherford formula is 
amazingly “robust.” 

^Problem 11.7 Find the scattering amplitude, in the Born approximation, for soft- 
sphere scattering at arbitrary energy. Show that your formula reduces to Equation 
11.72 in the low-energy limit. 


Problem 11.8 Evaluate the integral in Equation 11.81 to confirm the expression 
on the right. 

** Problem 11.9 Calculate the total cross-section for scattering from a Yukawa po¬ 
tential in the Born approximation. Express your answer as a function of E. 
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❖Problem 11.10 For the potential in Problem 11.4, 

(a) calculate /( 0 ), D(9), and cr, in the low-energy Bom approximation; 

(b) calculate f (6) for arbitrary energies, in the Bom approximation; 

(c) show that your results are consistent with the answer to Problem 11.4, in the 
appropriate regime. 


11.3.3 The Bom Series 

The Bom approximation is similar in spirit to the impulse approximation in classical 
scattering theory. In the impulse approximation we begin by pretending that the 
particle keeps going in a straight line (Figure 11.11), and compute the transverse 
impulse that would be delivered to it in that case: 

I = j F± dt. [11.85] 

If the deflection is relatively small, this should be a good approximation to the trans¬ 
verse momentum imparted to the particle, and hence the scattering angle is 

0 =X 2 Ln~\l/p), [11.86] 

where p is the incident momentum. This is, if you like, the “first-order” impulse 
approximation (the zeroth -order is what we started with: no deflection at all). Like¬ 
wise, in the zeroth-order Bom approximation the incident plane wave passes by with 
no modification, and what we explored in the previous section is really the first-order 
correction to this. But the same idea can be iterated to generate a series of higher-order 
corrections, which presumably converge to the exact answer. 


Actual 



Scattering center 


Figure 11.11: The impulse approximation assumes that the particle continues 
undeflected, and calculates the transverse momentum delivered. 
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Figure 11.12: Diagrammatic interpretation of the Born series, Equation 11.91. 

The integral form of the Schrodinger equation reads 

^(r) = ^o(r) + J g(r - r 0 )V(r 0 )r/r(r 0 ) d 3 r 0 , [11.87] 

where is the incident wave. 


g( r ) - -^“72- 

27th 1 r 


[ 11 . 88 ] 


is the Green’s function (into which I have now incorporated the factor 2 mjh 2 , for 
convenience), and V is the scattering potential. Schematically, 


if = fo + j gVf. [11.89] 

Suppose we take this expression for -fi, and plug it in under the integral sign: 

^ = 1^0 + j gVfio + j gVgVf. [11.90] 


Iterating this procedure, we obtain a formal series for \fn 


t = to +jgVir 0 +JgVgVfi 0 +JgVgVgVir 0 + • ■ • + JigVTifo + •■■ .[11.91] 


In each term only the incident wave function (i^o) appears, together with more and 
more powers of gV . The first Bom approximation truncates the series after the second 
term, but it is clear now how one generates the higher-order corrections. 

The Bom series can be represented diagrammatically as shown in Figure 11.12. 
In zeroth order i ft is untouched by the potential; in first order it is “kicked” once, and 
then “propagates” out in some new direction; in second order it is kicked, propagates 
to a new location, is kicked again, and then propagates out; and so on. In this 
context the Green’s function is sometimes called the propagator —it tells you how 
the disturbance propagates between one interaction and the next. The Bom series was 
the inspiration for Feynman’s formulation of relativistic quantum mechanics, which 
is expressed entirely in terms of vertex factors (V) and propagators (g), connected 
together in Feynman diagrams. 
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Problem 11.11 Calculate 0 (as a function of the impact parameter) for Rutherford 
scattering, in the impulse approximation. Show that your result is consistent with the 
exact expression (Problem 11.1a) in the appropriate limit. 

***Problem 11.12 Find the scattering amplitude for low-energy soft-sphere scattering 
in the second Bom approximation. Answer: —(2mVoa 3 /3h 2 )[\ — (4mVoa 2 /57i 2 )]. 


FURTHER PROBLEMS FOR CHAPTER 11 


***Problem 11.13 Find the Green’s function for the one-dimensional Schrodinger 
equation, and use it to construct the integral form (analogous to Equation 11.57). 
Answer: ^ 

-JJT ( e ii,x ~ xal V(xo)i/r(xo)dxo. [11.92] 

n^k J-oo 


**Problem 11.14 Use your result in Problem 11.13 to develop the Bom approxi¬ 
mation for one-dimensional scattering. That is, choose — Ae' kx , and assume 
i/f(xo) = iAo(-ro) to evaluate the integral. Show that the reflection coefficient takes 
the form 


R = 



2 

? 2ikx V(x)dx 


[11.93] 


Problem 11.15 Use the one-dimensional Bom approximation (Problem 11.14) to 
compute the transmission coefficient (T = 1 — R) for scattering from a delta function 
(Equation 2.96) and from a finite square well (Equation 2.127). Compare your results 
with the exact answers (Equations 2.123 and 2.151). 



AFTERWORD 


Now that you have (I hope) a sound understanding of what quantum mechanics says, 
I should like to return to the question of what it means —continuing the story begun 
in Section 1.2. The source of the problem is the indeterminacy associated with the 
statistical interpretation of the wave function. For (or, more generally, the quantum 
state — it could be a spinor, for example) does not uniquely determine the outcome 
of a measurement; all it provides is the statistical distribution of all possible results. 
This raises a profound question: Did the physical system “actually have” the attribute 
in question prior to the measurement (the so-called realist viewpoint), or did the act 
of measurement itself “create” the property, limited only by the statistical constraint 
imposed by the wave function (the orthodox position)—or can we duck the question 
entirely, on the grounds that it is “metaphysical” (the agnostic response)? 

According to the realist, quantum mechanics is an incomplete theory, for even 
if you know everything quantum mechanics has to tell you about the system (to wit, 
its wave function), you still cannot determine all of its features. Evidently there is 
some other information, external to quantum mechanics, which (together with *k) is 
required for a complete description of physical reality. 

The orthodox position raises even more disturbing problems, for if the act of 
measurement forces the system to “take a stand,” helping to create an attribute that was 
not there previously, 1 then there is something very peculiar about the measurement 
process. Moreover, to account for the fact that an immediately repeated measurement 
yields the same result, we are forced to assume that the act of measurement collapses 


'This may be strange, but it is not mystical, as some popularizers would like to suggest. The 
so-called wave-particle duality, which Niels Bohr elevated into a cosmic principle (complementarity), 
makes electrons sound like unpredictable adolescents, who sometimes behave like adults, and sometimes, 
for no particular reason, like children. I prefer to avoid such language. When I say that a particle does not 
have a particular attribute until a measurement intervenes, I have in mind, for example, an electron in the 
spin state x = (o ); a measurement of the x-component of its angular momentum could return the value 
h/2, or (with equal probability) the value —hj 2, but until the measurement is made it simply does not have 
a well-defined value of S x . 
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the wave function, in a manner that is difficult, at best, to reconcile with the normal 
evolution prescribed by the Schrodinger equation. 

In light of this, it is no wonder that generations of physicists retreated to the 
agnostic position, and advised their students not to waste time worrying about the 
conceptual foundations of the theory. 


A.1 The EPR Paradox _ 

In 1935, Einstein, Podolsky, and Rosen 2 published the famous EPR paradox, which 
was designed to prove (on purely theoretical grounds) that the realist position is the 
only sustainable one. I’ll describe a simplified version of the EPR paradox, due to 
David Bohm. Consider the decay of the neutral pi meson into an electron and a 
positron: 

7T° e~ + e + . 

Assuming the pion was at rest, the electron and positron fly off in opposite directions 
(Figure A.l). Now, the pion has spin zero, so conservation of angular momentum 
requires that the electron and positron are in the singlet configuration: 

-^(f_| + - 4,_f + ). [A.l] 

If the electron is found to have spin up, the positron must have spin down, and 
vice versa. Quantum mechanics can’t tell you which combination you’ll get, in 
any particular pion decay, but it does say that the measurements will be correlated, 
and you’ll get each combination half the time (on average). Now suppose we let 
the electron and positron fly way off—10 meters, in a practical experiment, or, in 
principle, 10 light years—and then you measure the spin of the electron. Say you get 
spin up. Immediately you know that someone 20 meters (or 20 light years) away will 
get spin down, if that person examines the positron. 

To the realist, there’s nothing surprising in this—the electron really had spin 
up (and the positron spin down) from the moment they were created—it’s just that 
quantum mechanics didn’t know about it. But the “orthodox” view holds that neither 
particle had either spin up or spin down until the act of measurement intervened: 
Your measurement of the electron collapsed the wave function, and instantaneously 
“produced” the spin of the positron 20 meters (or 20 light years) away. Einstein, 
Podolsky, and Rosen considered any such “spooky action-at-a-distance” (Einstein’s 
words) preposterous. They concluded that the orthodox position is untenable; the 

Figure A.1: Bohm’s version of the EPR 
n0 + experiment: 7t° at rest decays into 

% -•_ Z- electron-positron pair. 


2 A. Einstein, B. Podolsky, and N. Rosen, Phys. Rev. 47, 111 (1935). 
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electron and positron must have had well-defined spins all along, whether quantum 
mechanics can calculate them or not. 

The fundamental assumption on which the EPR argument rests is that no in¬ 
fluence can propagate faster than the speed of light. We call this the principle of 
locality. You might be tempted to propose that the collapse of the wave function is 
not instantaneous, but somehow “travels” out at some finite velocity. However, this 
would lead to violations of angular momentum conservation, for if we measured the 
spin of the positron before the news of the collapse had reached it, there would be a 
50-50 probability of finding both particles with spin up. Whatever one might think 
of such a theory in the abstract, the experiments are unambiguous: No such violation 
occurs—the correlation of the spins is perfect. 


A.2 Bell’s Theorem __ 

Einstein, Podolsky, and Rosen did not doubt that quantum mechanics is correct, as far 
as it goes; they only claimed that it is an incomplete discretion of physical reality: The 
wave function is not the whole story—some other quantity, k, is needed, in addition to 
q/, to characterize the state of a system fully. We call k the “hidden variable” because, 
at this stage, we have no idea how to calculate or measure it. 3 Over the years, a number 
of hidden variable theories have been proposed, to supplement quantum mechanics; 
they tend to be cumbersome and implausible, but never mind—until 1964 the program 
seemed eminently worth pursuing. But in that year J. S. Bell proved that any local 
hidden variable theory is incompatible with quantum mechanics. 4 

Bell suggested a generalization of the EPR/Bohm experiment: Instead of ori¬ 
enting the electron and positron detectors along the same direction, he allowed them 
to be rotated independently. The first measures the component of the electron spin 
in the direction of a unit vector a, and the second measures the spin of the positron 
along the direction b (Figure A.2). For simplicity, let’s record the spins in units of 
h /2; then each detector registers the value +1 (for spin up) or — 1 (spin down), along 
the direction in question. A table of results, for many it 0 decays, might look like this: 


electron 

positron 

product 

+1 

-1 

-1 

+ 1 

+ 1 

+1 

-1 

+1 

-1 

+ 1 

-1 

-1 

-1 

-1 

+ 1 


3 The hidden variable could be a single number, or it could be a whole collection of numbers; 
perhaps A. is to be calculated in some future theory, or maybe it is for some reason of principle incalculable. 
It hardly matters. All I am asserting is that there must be something —if only a list of the outcomes of 
every possible experiment—associated with the system prior to a measurement. 

4 Bell’s original paper [Physics 1, 195 (1964)] is a gem: brief, accessible, and beautifully written. 
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Figure A.2: Bell’s version of the EPR-Bohm experiment: detectors independently 
oriented in directions a and b. 

Bell proposed to calculate the average value of the product of the spins, for a given 
set of detector orientations. Call this average P( a, b). If the detectors are parallel 
(b = a), we recover the original EPRB configuration; in this case one is spin up and 
the other spin down, so the product is always —1, and hence so too is the average: 

P( a,a) = -l. [A.2] 

By the same token, if they are anh'-parallel (b = —a), then every product is +1, so 

P( a, -a) = +1. [A.3] 


For arbitrary orientations, quantum mechanics predicts 


P( a, b) = -a b 


[A.4] 


(see Problem 4.44). What Bell discovered is that this result is impossible in any local 
hidden variable theory. 

The argument is stunningly simple. Suppose that the “complete” state of the 
electron/positron system is characterized by the hidden variable(s) A; A varies, in 
some way that we neither understand nor control, from one pion decay to the next. 
Suppose further that the outcome of the electron measurement is independent of 
the orientation (b) of the positron detector—which may, after all, be chosen by the 
experimenter at the positron end just before the electron measurement is made, and 
hence far too late for any subluminal message to get back to the electron detector. 
(This is the locality assumption.) Then there exists some function A( a, A) which 
gives the result of an electron measurement, and some other function B(b, A) for the 
positron measurement. These functions can only 5 take on the values =t 1: 

^4(a, A) = ±1; fi(b,A) = ±l. [A.5] 


5 This already concedes far more than a classical determinist would be prepared to allow, for it aban¬ 
dons any notion that the particles could have well-defined angular momentum vectors with simultaneously 
determinate components. But never mind—the point of Bell’s argument is to demonstrate that quantum 
mechanics is incompatible with any local deterministic theory—even one that bends over backward to be 
accommodating. 
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When the detectors are aligned, the results are perfectly (antiCorrelated: 

A(a, X) = —B(a, X), [A.6] 


for all X. 

Now, the average of the product of the measurements is 


P(a, b) = J p(X)A(a, X)B(b,X)dX, 


[A.7] 


where p (X) is the probability density for the hidden variable. [Like any probability 
density, it is nonnegative, and satisfies the normalization condition / p(X) dX — 1, 
but beyond this we make no assumptions about p (X ); different hidden variable the¬ 
ories would presumably deliver quite different expressions for p.] In view of Equa¬ 
tion A.6, we can eliminate B: 


P(a,b) = - j p(X)A(a,X)A(b,X)dX. [A.8] 

If c is any other unit vector, 

P(a,b)-P{a,c) = - j p{X)[A(a,X)A{b,X) - A{a,X)A{c,X)]dX. [A.9] 
Or, since U(b, X)] 2 = 1: 

P(a, b) - P(a, c) = - J p(X)[l - A{b,X)A{c,X)]A(a,X)A{b,X)dX. [A.10] 

But it follows from Equation A.5 that -1 < [^4(a, A)J(b, A.)] < +1, and 
p{X)[\ — A(b, X)A( c, X)]>0, so 

| J P(a,b)-P(a,c)| < J p{X)[l-A(b,X)A(c,X)]dX, [A.ll] 
or, more simply, 


P( a, b) - P( a, c) < 1 -j- P(b, c). 


[A. 12] 


This is the famous Bell inequality. It holds for any local hidden variable theory 
(subject only to the minimal requirements of Equations A.5 and A.6), for we have 
made no assumptions whatever as to the nature or number of the hidden variables or 
their distribution (p). 

But it is easy to show that the quantum mechanical prediction (Equation A.4) 
is incompatible with Bell’s inequality. For example, suppose all three vectors lie in 
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Figure A.3: An orientation of the 
detectors that demonstrates quantum 
violations of Bell’s inequality. 


a plane, and c makes a 45° angle with a and b (Figure A.3); in this case quantum 
mechanics says 

P(a, b) = 0, P( a, c) = P(b, c) = -0.707, 
which is patently inconsistent with Bell’s inequality: 

0.707 ^ 1 - 0.707 = 0.293. 

With Bell’s modification, then, the EPR paradox proves something far more 
radical than its authors imagined: If they are right, then not only is quantum mechanics 
incomplete , it is downright wrong. On the other hand, if quantum mechanics is right, 
then no hidden variable theory is going to rescue us from the nonlocality Einstein 
considered so preposterous. Moreover, we are provided with a very simple experiment 
to settle the issue once and for all. 

Many experiments to test Bell’s inequality were performed in the 1960s and 
1970’s, culminating in the work of Aspect, Grangier, and Roger. 6 The details do not 
concern us here (they actually used two-photon atomic transitions, not pion decays). 
To exclude the remote possibility that the positron detector might somehow “sense” 
the orientation of the electron detector, both orientations were set quasi-randomly after 
the photons were already in flight. The results were in excellent agreement with the 
predictions of quantum mechanics and clearly incompatible with Bell’s inequality. 7 

Ironically, the experimental confirmation of quantum mechanics came as some¬ 
thing of a shock to the scientific community. But not because it spelled the demise of 
“realism”—most physicists had long since adjusted to this (and for those who could 


6 A. Aspect, P. Grangier, and G, Roger, Phys. Rev. Lett. 49, 91 (1982), 

7 Bell’s theorem involves averages, and it is conceivable that an apparatus such as Aspect’s contains 
some secret bias which selects out a nonrepresentative sample, thus distorting the average. Recently, an 
improved version of Bell’s theorem has been proposed in which a single measurement suffices to distinguish 
between the quantum prediction and that of any local hidden variable theory. See D. Greenberger, M. Home, 
A. Shimony, and A. Zeilinger, Am. J. Phys. 58, 1131, (1990) and N, David Mermin, Am. J. Phys. 58, 731, 
(1990). 
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Figure A.4: The shadow of the bug moves across the screen at a velocity v' 
greater than c, provided that the screen is far enough away. 


not, there remained the possibility of nonlocal hidden variable theories, to which 
Bell’s theorem does not apply 8 ). The real shock was the proof that nature itself is 
fundamentally nonlocal. Nonlocality, in the form of the instantaneous collapse of the 
wave function (and for that matter also in the symmetrization requirement for iden¬ 
tical particles) had always been a feature of the orthodox interpretation, but before 
Aspect’s experiment it was possible to hope that quantum nonlocality was some¬ 
how a nonphysical artifact of the formalism, with no detectable consequences. That 
hope can no longer be sustained, and we are obliged to reexamine our objection to 
instantaneous action at a distance. 

Why are physicists so alarmed at the idea of superluminal influences? After all, 
there are many things that travel faster than light. If a bug flies across the beam of a 
movie projector, the speed of its shadow is proportional to the distance to the screen; 
in principle, that distance can be as large as you like, and hence the shadow can move 
at arbitrarily high velocity (Figure A.4). However, the shadow does not carry any 
energy, nor can it transmit any message from one point to another on the screen. A 
person at point X cannot cause anything to happen at point Y by manipulating the 
passing shadow. 

On the other hand, a causal influence that propagated faster than light would 
carry unacceptable implications. For according to special relativity there exist inertial 
frames in which such a signal propagates backward in time —the effect preceding the 
cause—and this leads to inescapable logical anomalies. (You could, for example, 
arrange to kill your infant grandfather.) The question is, are the superluminal influ¬ 
ences predicted by quantum mechanics and detected by Aspect causal, in this sense, 


8 It is a curious twist of fate that the EPR paradox, which assumed locality to prove realism, led 
finally to the repudiation of locality and left the issue of realism undecided—the outcome (as Mermin put 
it) Einstein would have liked least. Most physicists today consider that if they can’t have local realism, 
there’s not much point in realism at all, and for this reason nonlocal hidden variable theories occupy a rather 
peripheral place. Still, some authors—notably Bell himself, in Speakable and Unspeakable in Quantum 
Mechanics (Cambridge University Press, Cambridge, 1987)—argue that such theories offer the best hope 
of bridging the gap between the measured system and the measuring apparatus, and for supplying an 
intelligible mechanism for the collapse of the wave function. 
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or are they somehow ethereal enough (like the motion of the shadow) to escape the 
philosophical objection? 

Well, let’s consider Bell’s experiment. Does the measurement of the electron 
influence the outcome of the positron measurement? Assuredly it does —otherwise 
we cannot account for the correlation of the data. But does the measurement of the 
electron cause a particular outcome for the positron? Not in any ordinary sense of 
the word. There is no way the person monitoring the electron detector could use his 
measurement to send a signal to the person at the positron detector, since he does 
not control the outcome of his own measurement (he cannot make a given electron 
come out spin up, any more than the person at X can affect the passing shadow of 
the bug). It is true that he can decide whether to make a measurement at all, but the 
positron monitor, having immediate access only to data at his end of the line, cannot 
tell whether the electron was measured or not. For the lists of data compiled at the 
two ends, considered separately, are completely random. It is only when we compare 
the two lists later that we discover the remarkable correlations. In another reference 
frame, the positron measurements occur before the electron measurements, and yet 
this leads to no logical paradox—the observed correlation is entirely symmetrical in 
its treatment, and it is a matter of indifference whether we say the observation of 
the electron influenced the measurement of the positron, or the other way around. 
This is a wonderfully delicate kind of influence, whose only manifestation is a subtle 
correlation between two lists of otherwise random data. 

We are led, then, to distinguish two types of influence: the “causal” variety, 
which produce actual changes in some physical property of the receiver, detectable by 
measurements on that subsystem alone, and an “ethereal” kind, which do not transmit 
energy or information, and for which the only evidence is a correlation in the data 
taken on the two separate subsystems—a correlation which by its nature cannot be 
detected by examining either list alone. Causal influences cannot propagate faster 
than light, but there is no compelling reason why ethereal ones should not. The 
influences associated with the collapse of the wave function are of the latter type, and 
the fact that they “travel” faster than light may be surprising, but it is not, after all, 
catastrophic. 9 


A. 3 What is a Measurement? 


The measurement process plays a mischievous role in quantum mechanics: It is here 
that indeterminacy, nonlocality, the collapse of the wave function, and all the atten¬ 
dant conceptual difficulties arise. Absent measurement, the wave function evolves in 
a leisurely and deterministic way, according to the Schrodinger equation, and quan¬ 
tum mechanics looks like a rather ordinary field theory [much simpler than classical 


9 An enormous amount has been written about Bell’s theorem. My favorite is an inspired essay 
by David Mermin in Physics Today (April 1985, page 38). An extensive bibliography will be found in 
L. E, Ballentine, Am. J. Phys. 55, 785 (1987). 
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electrodynamics, for example, since there is only one field (4'), instead of two (E 
and B), and it’s a scalar ]. It is the bizarre role of the measurement process that gives 
quantum mechanics its extraordinary richness and subtlety. But what, exactly, is a 
measurement? What makes it so different from other physical processes? 10 And how 
can we tell when a measurement has occurred? 

Schrodinger posed the essential question most starkly, in his famous cat para¬ 
dox: 11 


A cat is placed in a steel chamber, together with the following hellish contraption 
.... In a Geiger counter there is a tiny amount of radioactive substance, so tiny 
that maybe within an hour one of the atoms decays, but equally probably none 
of them decays. If one decays then the counter triggers and via a relay activates 
a little hammer which breaks a container of cyanide. If one has left this entire 
system for an hour, then one would say the cat is living if no atom has decayed. 
The first decay would have poisoned it. The wave function of the entire system 
would express this by containing equal parts of the living and dead cat. 


At the end of the hour, the wave function of the cat has the schematic form 


f = 7= (V^alive + lAdead). 

V2 


[A.13] 


The cat is neither alive nor dead, but rather a linear combination of the two, until a 
measurement occurs—until, say, you peek in the window to check. At that moment 
your observation forces the cat to “take a stand”: dead or alive. And if you find it to 
be dead, then it’s really you who killed it, by looking in the window. 

Schrodinger regarded this as patent nonsense, and I think most physicists would 
agree with him. There is something absurd about the very idea of a macroscopic object 
being in a linear combination of two palpably different states. An electron can be 
in a linear combination of spin up and spin down, but a cat simply cannot be in a 
linear combination of alive and dead. How are we to reconcile this with the orthodox 
interpretation of quantum mechanics? 

The most widely accepted answer is that the triggering of the Geiger counter 
constitutes the “measurement,” in the sense of the statistical interpretation, not the 
intervention of a human observer. It is the essence of a measurement that some 
macroscopic system is affected (the Geiger counter, in this instance). The measure¬ 
ment occurs at the moment when the microscopic system (described by the laws of 


l0 There is a school of thought that rejects this distinction, holding that the system and the measur¬ 
ing apparatus should be described by one great big wave function which itself evolves according to the 
Schrodinger equation. In such theories there is no collapse of the wave function, but one must typically 
abandon any hope of describing individual events—quantum mechanics (in this view) applies only to 
ensembles of identically prepared systems. See, for example, Philip Pearle Am. J. Phys. 35, 742 (1967), 
or, more recently, Leslie E. Ballentine, Quantum Mechanics, (Prentice Hall, Englewood Cliffs, NJ, 1990). 

n E. Schrodinger, Naturwiss. 48,52 (1935); translation by Josef M. Jauch, Foundations of Quantum 
Mechanics, (Reading, MA: Addison-Wesley, 1968), p. 185. 
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quantum mechanics) interacts with the macroscopic system (described by the laws of 
classical mechanics) in such a way as to leave a permanent record. The macroscopic 
system itself is not permitted to occupy a linear combination of distinct states. 12 

I would not pretend that this is an entirely satisfactory resolution, but at least 
it avoids the stultifying solipsism of Wigner and others, who persuaded themselves 
that it is the intervention of human consciousness that constitutes a measurement in 
quantum mechanics. Part of the problem is the word “measurement” itself, which 
certainly carries an suggestion of human involvement. Heisenberg proposed the word 
“event”, which might be preferable. But I’m afraid “measurement” is so ingrained 
by now that we’re stuck with it. And, in the end, no manipulation of the terminology 
can completely exorcise this mysterious ghost. 


A.4 The Quantum Zeno Paradox _ 

The collapse of the wave function is undoubtedly the most peculiar feature of this 
whole story. It was introduced on purely theoretical grounds, to account for the fact 
that an immediately repeated measurement reproduces the same value. But surely 
such a radical postulate must carry directly observable consequences. In 1977 Misra 
and Sudarshan 13 proposed what they called the quantum Zeno effect as a dramatic 
experimental demonstration of the collapse of the wave function. Their idea was to 
take an unstable system (an atom in an excited state, say) and subject it to repeated 
measurements. Each observation collapses the wave function, resetting the clock, 
and it is possible by this means to delay indefinitely the expected transition to the 
lower state. 14 

Specifically, suppose a system starts out in the excited state ^2, which has a 
natural lifetime r for transition to the ground state tfr j. Ordinarily, for times sub¬ 
stantially less than r, the probability of a transition is proportional to t (see Equa¬ 
tion 9.42); in fact, since the transition rate is 1/r, 

= -. [A. 14] 

T 

If we make a measurement after a time t, then, the probability that the system is still 
in the upper state is 

P,(0 = [A.15] 


12 Of course, in some ultimate sense the macroscopic system is itself described by the laws of 
quantum mechanics. But wave functions, in the first instance, describe individual elementary particles; the 
wave function of a macroscopic object would be a monstrously complicated composite, built out of all the 
wave functions of its 10 23 constituent particles. Presumably somewhere in the statistics of large numbers 
macroscopic linear combinations become extremely improbable. 

13 B. Misra and E. C. G. Sudarshan, J. Math. Phys. 18, 756 (1977). 

,4 This phenomenon doesn’t have much to do with Zeno, but it is reminiscent of the old adage “a 
watched pot never boils,” so it is sometimes called the watched pot effect. 
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Suppose we do find it to be in the upper state. In that case the wave function collapses 
back to ^ 2 , and the process starts all over again. If we make a second measurement, 
at 2 1, the probability that the system is still in the upper state is evidently 

H ) 2 *1 - 7 ’ [A16] 

which is the same as it would have been had we never made the measurement at t. 
This is certainly what one would naively expect; if it were the whole story there would 
be nothing gained by observing the system, and there would be no quantum Zeno 
effect. 

However, for extremely short times, the probability of a transition is not pro¬ 
portional to t, but rather to t 2 (see Equation 9.39) 15 : 

Pi^=at 2 . [A.17] 

In this case the probability that the system is still in the upper state after the two 
measurements is 

(l - at 2 ) 2 « 1 - 2at 2 , [A. 18] 

whereas if we had never made the first measurement it would have been 

1 - a(21) 2 ^ 1 - 4at 2 . [A.19] 

Evidently our observation of the system after time t decreased the net probability of 
a transition to the lower state! 

Indeed, if we examine the system at n regular intervals, from t = 0 out to t = T 
(that is, we make measurements at T/n,2T/n, 3T/n , ..., T), the probability that 
the system is still in the upper state at the end is 

(1 - a(T/nf) n ^ 1 - - T 2 , [A.20] 

v n 

which goes to 1 in the limit n —> co: A continuously observed unstable system never 
decays at all! Some authors regard this as an absurd conclusion, and a proof that 
the collapse of the wave function is fallacious. However, their argument hinges on a 
rather loose interpretation of what constitutes “observation.” If the track of a particle 
in a bubble chamber amounts to “continuous observation,” then the case is closed, for 
such particles certainly do decay (in fact, their lifetime is not measureably extended 
by the presence of the detector). But such a particle is only intermittently interacting 
with the atoms in the chamber, and for the quantum Zeno effect to occur the successive 
measurements must be made extremely rapidly to catch the system in the t 2 regime. 


15 In the argument leading to linear time dependence, we assumed that the function sin 2 (ftt/2)/£2 2 
in Equation 9.39 was a sharp spike. However, the width of the “spike” is of order A co = 4jt jt, and for 
extremely short f this approximation fails, and the integral becomes {t 2 j 4) J p(co)dco. 
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As it turns out, the experiment is impractical for spontaneous transitions, but it 
can be done using induced transitions, and the results are in excellent agreement with 
the theoretical predictions. 16 Unfortunately, this experiment is not as compelling a 
confirmation of the collapse of the wave function as its designers hoped; the observed 
effect can be accounted for in other ways. 17 


In this book I have tried to present a consistent and coherent story: The wave 
function (T') represents the state of a particle (or system); particles do not in general 
possess specific dynamical properties (position, momentum, energy, angular mo¬ 
mentum, etc.) until an act of measurement intervenes; the probability of getting a 
particular value in any given experiment is determined by the statistical interpreta¬ 
tion of ; upon measurement the wave function collapses, so that an immediately 
repeated measurement is certain to yield the same result. There are other possible 
interpretations—nonlocal hidden variable theories, the many worlds picture, en¬ 
semble models, and others—but I believe this one is conceptually the simplest , and 
certainly it is the one shared by most physicists today. It has stood the test of time, 
and emerged unscathed from every experimental challenge. But I cannot believe this 
is the end of the story; at the very least, we have much to learn about the nature of 
measurement and the mechanism of collapse. And it is entirely possible that future 
generations will look back, from the vantage point of a more sophisticated theory, 
and wonder how we could have been so gullible. 


16 W, M, Itano, D. J. Heinzen, J. J. Bollinger, and D. J. Wineland, Phys. Rev. A 41, 2295 (1990). 
17 L, E, Ballentine, Found Phys. 20, 1329 (1990); T. Petrosky, S. Tasaki, and I. Prigogine, Phys. 
Lett. A 151, 109 (1990). 
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tunneling, 280-84 
Watched pot effect, 383 
Wave function, 1-2, 11, 20, 22 

collapse, 4, 158-59, 374-75, 381-82, 385 
delta function well, 55 
finite spherical well, 132 
finite square well, 61 
harmonic oscillator, 35, 41 
helium, 187 
hydrogen, 139 
infinite spherical well, 132 
infinite square well, 26 
three-dimensional, 231 
Wave packet, 46-47 
Gaussian, 50, 69, 112 
minimum-uncertainty, 112 
velocity of, 48 
traveling, 69 




394 Index 


Wave vector, 194 
Wavelength, 16 
Wave-particle duality, 374 
White dwarf, 218-19 
Wien’s displacement law, 217 
Wigner, Eugene, 5, 383 

Y 

Yukawa potential, 271, 370 


z 

Zeeman effect, 244-50 
intermediate field, 248^-9 
strong field, 246-47 
weak field, 245^16 
Zeno effect, 383-85 
Zero: 
matrix, 88 
vector, 76 

Zeta function, 108, 216 



